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PREFACE 


Recent decades have seen extensive application of mathematics 
in all spheres of man’s activity. The use of mathematics as a 
tool for simulating the world around us has increased, and as 
a result a mathematical education has become ever more 
necessary. 

“Linear algebra with elements of analytic geometry” is a bas- 
ic course for mathematics students in technical schools. The 
topic is important because most applied problems are either 
“linear” in nature or admit of “linearization”. 

The concept of a vector underlies linear algebra. In a narrow 
sense, a vector is an arrow or a directed line segment in the 
plane or in space. However, mathematicians have extended the 
concept and this in more general interpretation covers a wider 
circle of objects. 

The vector is primarily used in analytic geometry. The basic 
idea in this branch of mathematics is the coordinates, that is, 
numbers defining the position of a geometric object. The coor- 
dinate method was developed to solve geometrical problems 
and then extended to all fields of mathematics and so it became 
a universal tool of its application. 

The union of two main ideas, those of coordinates and a 
vector, underlies the course on analytic algebra with the ele- 
ments of analytic geometry. The mathematical methods a stu- 
dent masters when studying the course will be of practical use 
in all applications of mathematics. 

This study aid is in two parts. The first part presents the 
elements of analytic geometry and the theory of determinants, 
while the second part is devoted to linear algebra, namely, sys- 
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tems of linear equations, matrix theory, and vector, Euclidean, 
and affine spaces. The concepts of linear algebra are mainly 
studied using arithmetic vector spaces R” (n= 1, 2, 3), 
although Chapter 10 also presents abstract vector spaces. All 
problems of linear algebra are in any case reduced to analyzing 
systems of linear equations. We use Gaussian elimination for 
this purpose. We believe that this approach best serves the aim 
of the course, which is to teach the student methods for solving 
linear problems avoiding wherever possible abstract notions. 

The book is oriented to high-school graduates. Neverthe- 
less, a small part of the material demands more of the student, 
namely Chapter 5 (mth-order determinants), Section 10.5 (ab- 
stract vector spaces), and Chapter 13 (affine spaces). Each 
chapter (except for Chapter 8) ends with exercises. 

When selecting the material for this study aid, we were guid- 
ed by the course curriculum and a desire for simplicity of 
presentation. 

We would like to express our appreciation to the reviewers 

rofessor O. V. Manturov and Z. M. Egozar’yan, who com- 
piled the course curriculum, for their constructive and valuable 
suggestions. 


The authors 


Part One 
ANALYTIC GEOMETRY 


Chapter 1 


VECTORS IN THE PLANE AND IN SPACE. 
CARTESIAN COORDINATE SYSTEM 


1.1. VECTORS 


1. Notion of a vector. 
Many physical quantities encountered in mathematics and its 
applications, such as the length of a line segment, the area of 
a figure, the volume and mass of a body, are completely 
described by real numbers, their magnitudes. These quantities 
are called scalars. Other physical quantities, such as force and 
velocity are not completely determined by a number, they also 
possess direction. Such quantities are called vectors. 
Definition. A directed line segment or vector is a line seg- 
ment whose endpoints are specified. A vector is directed from 
its initial to its terminal point. 
Vectors are represented geometrically as arrows (Fig. 1). The 
tail of the arrow is called the initial point of the vector, and 
. the tip of the arrow the terminal point. A vector whose. initial 
point is 4 and whose terminal point | is B is denoted AB, and 
its length or magnitude is denoted |AB|. Another notation of 
a vector is lowercase boldface type such as a and its magnitude 
is denoted |a| or a. The initial point of the vector AB (point 
A) is sometimes called the point of its application. - 


B 


terminal point 


initial point 


Figure 2 


Figure 1 
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Definition. Vectors lying on the same line or on parallel lines 
are called collinear vectors. 

Any two of the three vectors in Fig. 2 are collinear to each 
other. 

Definition. Vectors which are collinear, have the same length, 
and are in the same direction are called equivalent. 

If vectors are equivalent, we write AB — CD. 

The definition of vector equality implies that the point of 
application of the vector can be arbitrarily chosen. In this 
sense, vectors are often called free. The vectors AB and CD 
shown i in Fig. 3; are equivalent, while the vectors MN and MP 
and EF and GH are not. 

If the initial point of a vector coincides with its terminal 
point, the vector is represented by a point and have no specific 
direction. Such a vector is called the zero vector and is denoted 
by 0. Thus, the equality a = 0 is read as “the vector a equals 
zero", which means that the initial and terminal points of a 
coincide. The magnitude of the zero vector is zero. 

From the definition of collinear vectors we infer that the zero 
vector is collinear with any vector. 

2. Addition of vectors. 

A course of high-school mathematics covers the operation 
of vector addition. Let us recall the geometric interpretation 
of vector addition, that is, the triangle law of addition. 

If a and b are any two vectors, then the sum a + b is the 
vector constructed as follows (Fig. 4). Place the initial point of 
a vector a’, which is equivalent to a, at any point O and posi- 
tion a vector b', which is equivalent to b, so that its initial 
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Figure 4 


point coincides with the terminal point of a’. The vector a + b 
is the arrow from the initial point of a’ (ie the point O) to 
the terminal point of b’. 

We emphasize that the vector a + b so constructed does not 
depend on the position of the point O. If we take any other 
point, say O*, and perform the same construction, we obtain 
the same vector a + b (Fig. 5). 

If vectors a and b are not collinear, we can apply the 
parallelogram law of addition. To obtain the vector a + b, we 
place the initial points of a and b at the same point O and 
construct a parallelogram (Fig. 6). The diagonal from the point 
O is the vector a + b. 

The parallelogram law is often applied in physics problems, 
for instance, when composing forces, i.e. to find the resultant 
force. 

Here are some properties of vector addition familiar from 
high-school mathematics. 

1°. Commutativity, 


a+b=b+t+a (1) 


Figure 5 Figure 6 
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Figure 7 


Figure 8 


The property is illustrated in Fig. 7 when a and b are not 
collinear. 
29. Associativity, 


(a+ b)+c=a+ (b+c) (2) 


(see Fig. 8). 

The two properties imply that the sum of any number of 
vectors is not affected by the order of addition of the methods 
of grouping. 

To construct the sum a, + 32 + ... + an, it is most con- 
venient to apply the rule of closure of an open polygon, which 
is a generalization of the triangle law and is as follows. If we 
go from the initial point of the vector a; to the terminal point 
of the vector a, by way of vectors a), a2, ..., a4, then the vector 
from the initial point of a; to the terminal point of a, is their 
sum a; + a+... + an (Fig. 9). 

3. Vector subtraction. 

Definition. Given any two vectors a and b, their difference 
b — a isa vector d, such that when added to the vector a yields 
the vector b. 


Q,+a2+...+ Qn 0 a A 


Figure 9 Figure 10 
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Figure 11 Figure 12 


Hence follows the rule for vector subtraction. We position 
vectors a and b so that their initial points coincide (point O, 
see Fig. 10), the vector from the terminal point (A) of a to the 
terminal point (B) of b is the vector b — a. Indeed, if a= OA 
and b = OB, then OA + AB = OB, ie. AB = OB — OA, or 


AB-b-a 
The vector with the same length as the vector a but opposite- 


ly directed is called the negative of (opposite to) a and is denot- 
ed by —a. We can easily see that 


a+(-a)=0 (3) 
Obviously, if a = AB, then —a = BA, and we have (see Fig. 11) 
a +(-a) = AB + BA = AA - 0 
Formula (3) yields the following for the difference of two 
vectors 
b—a=b + (-a) (4) 
Indeed, if we add the vector a and the vector b + (— a), we get 
a+ [b + (-—a)] = b+ [a+(-a)] =b+0=b 

Formula (4) is most useful when we add or subtract more 

than two vectors. For instance, to find the difference a — b — 

€ — d, we should use vectors a, —b, —c, —d to construct their 
sum according to the rule of closure of an open polygon. 


4. Multiplication of a vector by a scalar. 
If a is a vector and ) is a number (scalar), then the product 
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Aa 


(-A]a 


Figure 13 Figure 14 


^a is a vector which is collinear to a, has the length |A|-|a|, and 
has the same direction as a if A > 0 and is opposite to a if 
^ < 0. Here |A| denotes the absolute value of the number A. 
The product 0a is equal to zero (zero vector). 

If ^ > 0, the multiplication of a by ^ can be visualized as 
follows: the vector ^a is the result of "stretching" the vector 
aà times. However, we mean a mathematical stretching, so that 
if, for instance, \ = 1/2, stretching ^ times means that the 
length of a is reduced to one half. If À < 0, stretching |A| times 
is accompanied by changing the direction of a to the opposite 
(Fig. 12). 

The operation of vector multiplication by scalars possesses 
the following property: 


d(ua) = Qu) a (5) 


The vectors \(ua) and (A4) a have the same magnitude equal 
to |A||||a], have the same direction as a if \ and y are of the 
same sign and have the opposite direction if à and p are of 
opposite signs. If either \ or p is zero, both vectors A(ua) and 
(Au) a are zero. 

Here is another property of the operation of vector multipli- 
cation by a scalar: 


-0a)-2(-Xa (6) 


This property is illustrated in Fig. 13. 

Theorem (on collinear vectors). Zf a and b are two collinear 
vectors and a is nonzero, then there is a unique number ) such 
that b = da. 
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n If b = 0, then à = 0. Now let b 0. Let A = |b|/la| if the 
vectors b and a are in the same direction, and \ = — |b|/la| if 
they have opposite directions. Then b = da. 

If b = ya along with b = da, then (A — p) a = 0, and since 
a;:20, we haveX — p = 0, ie A — g. m 

5. Linear operations on vectors. 

The operations of vector addition and multiplication by a 
number are called /inear operations. (The operation of subtrac- 
tion is defined via addition and thereby is considered to be 
*secondary".) 

In the school course the following two properties, each in- 
volving both linear operations, were proved: 


(+p) a= dat pa (7) 
A(a + b) = da + Xb (8) 


Here a, b, i, and y are arbitrary. Figure 14 illustrates Eq. (8) 
for A » 1. 

Properties (1) through (8) are important since they make it 
possible to perform calculations in vector algebra basically as 
in common algebra, for instance, we can use the rules of arith- 
metic for removing brackets and factoring out. 

6. A Linear combination of several vectors. 

Suppose we have several vectors ar, a2, ..., an. 

Definition. A linear combination of vectors ai, 32, ..., An 
is the sum of the products of these vectors multiplied by any 
numbers ci, C2, ..., Cn. 


For instance, 3a — E b + 7c is a linear combination of the 


vectors a, b, c. 

Example. A quadrilateral ABCD is given, and P and Q are 
the midpoints of the sides BC and AD ) respectively (Fig. 15), 
express the vector PO via the vectors AB, BC, CD. 

© We have 


—R 


A 


—. 


PQ = PB + BA + AQ = - 


—> — 


BC — AB + 


— 


Since AB + BC + CD + DA = AA =0 and hence DA = 


| 
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G 


Figure 15 


-(AB + BC + CD), we get 


PQ = - 5 BC - AB +} ( 
a bape ee 
= - } Ab + } b 


nol m 


Figure 16 


We see that the expression does not contain the vector BC. e 
7. A line segment divided in a ratio. 
Let a point C divide a line segment AB (see Fig. 16) in the 


n a is 
ratio — , ie. 


B 


Then, 


Q) 


We connect points A, B, and C with a point O and formulate 
the following problem: express vector OC in terms of vectors 


OA and OB. 


O We have AC = OC — OA and CB = OB - OC. Mul- 
tiplying both sides of the first equation by 8 and both sides 


of the second by o, we have 


BAC = (OC — OA) and aCB = a(OB — OC) 
Since, by virtue of (9), the left-hand sides of the equalities are 


2— 366 
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Figure 17 


equal, their right-hand sides are also equal 
B(OC — OA) = a(OB - OC) 
We write this equation as (œ + 8) OC = BOA + aOB, whence 


m B om d 
OOS gan er er: 
Formula (10) is widely used in calculations. 

Example. Prove that the medians of a triangle intersect at 
one point, which is the centroid of the triangle, and that this 
point divides each median in the ratio 2:1 reckoning n the 
vertex along the median. 

O We choose an arbitrary point O in the plane of the trian- 
gle. Then consider point P which divides median AA’ in the 
ratio 2:1 (Fig. 17). By formula (10) we have 


OB ® (10) 


y 1 — 2 T, 
POI XU TY led 
Bed la 
OA x. B+, C 


Whence 


Ee 
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Figure 18 


Vectors OA , OB, and OC are in equal proportion in this expres- 
sion. Hence it is clear that if we take points Q and R which 
divide medians BB' and CC" respectively i in the ratio 2:1, we 
obtain similar expressions for vectors OQ and OR. Conse- 
quently, OP = OO = OR, which proves that points P, Q, and 
R coincide. 6 


Here is another example. It generalizes the preceding example and represents 
Ceva's theorem from elementary geometry, viz. if the sides of a triangle are 
divided in the ratios «:8, y:a, B:y (see Fig. 18), then the straight lines joining 
the vertices to the points of division of the opposite sides intersect at one point. 

a We consider point P which divides the line segment AA’ in the ratio 
(8 + yXo. From formula (10) we have 


OP = — BA. y gp 
a+ß+y a+Bty 

= a gyp eee 8 B+ 2 oC 
a+B+y a+B+ty\Bry Bty 


= l — (aOA + BOB + yOC) 
a+Bt+y 


Thus it is clear that if we take point Q, which divides line segment BB’ in 
the ratio (a + y):8, and point R, which divides line segment CC’ in, the ae 
(œ + 8):y, then we obtain n similar expressions for vectors OQ and OR. Conse- 
quently, OP = OQ = OR, which proves that points P, Q, and R 
coincide. mw 

The theorems on the intersection of bisectors and on the intersection of the 
altitudes of a triangle at one point are special cases of Ceva's theorem. 
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1.2. VECTOR BASIS IN THE PLANE AND IN SPACE 


1. Vector basis in the plane. Representation of a vector in 
terms of the basis. 
Lemma. /f vectors a and b are not collinear, then the 
equation 
aa + Bb =0 (1) 


holds true if and only if both o and Q are zero. 
o Let, say, a #0. Then (1) yields 


and this contradicts the hypothesis that a and b are not col- 
linear. Therefore, œ = 0. We can prove that 8 = 0 in a similar 
way. m 

Let us now consider a plane in space. 

Definition. A vector basis in a given plane is a set of any 
two noncollinear vectors ej and ez in the plane. 

The vectors e; and e2 are termed the first and the second 
basis vectors respectively. 

We prove the following important theorem. 

Theorem. Let ei, e be a vector basis in the plane. Then any 
vector a in the plane can be uniquely represented as a linear 
combination of the basis vectors e and e; 


a= Xe, + Ye; (2) 


o We reduce the vectors ei, e2, and a to a common origin 
O and draw a straight line through point A (the terminus of 
a) and parallel to e2. Let A: be the point of intersection of 
the straight line and the axis of ei (Fig. 19); point A; does exist 
since _the vectors e; and e? are not collinear. We have 
a= OA, + AiA. By the theorem on collinear vectors, 
OA 1 = Xe, and A\A = Ye, where X and Y are numbers, 
whence a = Xe; + Yez, and this is the representation of the 
vector a as a linear combination of the vectors ej and ez. 
We now prove that this representation is unique. Let the 
equality 
a= X*e + Y*e; (3) 
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Ao A 


ea 


Figure 19 


as well as Eq. (2) be true. Subtracting (2) from (3) we have 
(X-X*)e-(Y- Y9e-0 


According to the above lemma this means that X — X* and 
Y S320 

Definition. Equality (2) is the representation of the vector 
a in terms of the basis ej, e2 and the numbers X and Y are 
called the coordinates of a in the basis ei, e; (or in terms of 
the basis ei, e). 

This definition of vector coordinates implies that equal vec- 
tors have equal coordinates. Indeed, if a = Xe, + Ye; and 
a’ = X’e, + Y’e, then the equality a = a’ implies that 


(Oa tty’ = ra =a 


whence from the lemma we have X’ = X and Y’ = Y. 

For the sake of brevity we accept the following notation: if 
X and Y are the coordinates of the vector a in the basis ei, 
e2, we shall write a = (X, Y) in the basis ei, e»; if the basis 
is known, we shall simply write a = (X, Y». 

Let us return to the proof of the representation theorem. A 
direct corollary of the proof is the following proposition: let 
vectors ei, ey, and a be reduced to a common origin O. Through 
the terminus of a we draw a straight line parallel to the vector 
€i. Let A» be the point of intersection of the straight line and 
the axis of the vector e2. Then OA; = Yeo, where Y is the se- 
cond coordinate of a in the basis ej, e. m 

Indeed, _We can see from Fig. 19 that OÀ; = A14; conse- 
quently, OA; = Ye. 
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Figure 20 


By changing the indices of the basis vectors we can always 
get ei to be the second basis vector (we assume that ej = e2, 
ez = ei). Hence it is clear that we have an analogous proposi- 
tion for the first coordinate of the vector a: through the termi- 
nus of a we draw a straight line parallel to the vector e;. Let 
A, be the point where the straight line intersects the axis of 
the vector ey. Then OA | = Xe), where X is the first coordinate 
of a. 

Example. Given a parallelogram ABCD (Fig. 20). The points 
P and Q are the midpoints of the sides BC and CD respectively, 
des the coordinates of the vector PO if we assume that 

= AD and e2 = AB are the basis vectors. 


O We have PQ = PC + CO = 5 AD-} AB= Se, = 
1e. Hence PO = (1/2, —1/2) in the basis ei, e. @ 


2. Coplanar vectors. 

We say that the vector AB is parallel to a given plane a if 
the straight line AB containing this vector is parallel to the 
plane. A zero vector is assumed to be parallel to any plane. 

Definition. Several vectors a, b, c, ... in space are called 
coplanar if they are parallel to the same plane. 

If vectors OA, OB, OC, .. with the common origin O are 
coplanar, then the points A, B, C, ... are obviously in the same 
plane. In this sense we can say that coplanar vectors can be 
translated to the same plane. 

Example. Consider a pyramid with the vertices A, B, C, D 
(Fig. 21). The vectors BC, CD, DB are obviously coplanar, 
while the vectors AD, AB, AC are not, neither are the vectors 
AB, AD, DC: otherwise the point C would lie in the plane 
ABD 
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Figure 21 Figure 22 


Two vectors are always coplanar, while three vectors can be 
noncoplanar. If the vectors OA, OB, OC are not coplanar, we 
can obtain three distinct planes by drawing through the termi- 
nus of each vector a plane parallel to the two remaining vectors. 
These three planes together with | three other planes, which are 
defined by the pairs of vectors OA and OB, OA and OC, OB 
and OC, respectively, enclose a parallelepiped (Fig. 22). If P 
is a vertex of the parallelepiped, then 


OP = OQ + OP = OQ + OC = OA + OB + OC. 


This expression gives : a way of f constructing the sum of three 
noncoplanar vectors OA, OB, OC: we consider the parallele- 
piped with vertex O and the line segments OA, OB, OC as its 
three edges; then the vector OA + OB + OC isa principal di- 
agonal of the parallelepiped starting at the vertex O. 

3. Vector basis in space. Representation of an arbitrary vector 
in terms of the basis. 

Lemma. 7f three vectors a, b, and c are noncoplanar, then 
the equality 


ca + fb + yc =0 (4) 


holds true if a = 0, 820, y = 0. 
Uu Let, for example, a # 0. Then from (4) we have 


LB, dx (5) 
a a 
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Figure 23 


If vectors b and c are collinear, then (5) implies that all three 
vectors a, b, and c are collinear, and hence are coplanar. 1f b 
and c are not collinear, then after being reduced to a common 
origin they will define some plane; then by (5) the three vectors 
a, b, and c are parallel to that plane. Thus the assumption that 
a # Oled us to a contradiction with the hypothesis. This means 
that a = 0. Similarly we can prove that 8 = 0 and y = 0. m 

Definition. A vector basis in space is a set of any three non- 
coplanar vectors ej, ej, and es. 

The vectors ei, e, and e; are called the first, the second, 
and the third basis vectors. 

Theorem. Let ei, e;, and e; be a vector basis in space. Then 
any vector a in space can be uniquely represented as a linear 
combination of the basis vectors ey, ez, and es: 


a = Xe, + Ye + Ze3 (6) 


o We reduce the vectors ej, e;, es, and a to a common origin 
O and denote the plane defined by the vectors e; and e; by 
a. We draw through a point A (the terminus of a) a straight 
line / parallel to e3 (Fig. 23). We designate the point where the 
straight line intersects the plane a as P (the point P does exist 
since the straight line / is not parallel to o). We have 


a= OA = OP + PA. 
By virtue of the theorem proved in Sec. 1.2.1 the vector OP 


can be represented as a linear combination ej and e: 
OP = Xe, + Ye; by the theorem on collinear vectors, 
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PA = Ze;. Hence we have a = Xe, + Ye; + Zes, and this 
proves that the vector a can be represented as a linear combina- 
tion of the vectors ej, e2, and es. 

The uniqueness of the representation can be proved in the 
same way as in the preceding theorem. m 

Definition. Expression (6) is the representation of vector a 
in the basis ej, ez, and ej, and the numbers X, Y, and Z are 
called the coordinates of a in the basis ei, €», es. 

As in the case of the plane, here we accept the following nota- 
tion: if X, Y, Z are the coordinates of the vector a in the basis 
€1, €2, es, then we shall write a = €X, Y, Z) in the basis ei, 
€2, €3. 

If the basis is predetermined, then we simply write a = (X, 
Y, Z}: 

We now turn to the proof of the last theorem again. A direct 
consequence of the proof is the following proposition: let vec- 
tors ej, €, es, and a be reduced to a common origin O. 
Through the terminus of a we draw a plane parallel to the vec- 
tors ey and e; and designate by A3 the point of intersection 
of this plane and the axis of e. Then OA; = Zes, where Z is 
the third coordinate of the vector a in i the basis ei, €, €3. 

Indeed, we can see from Fig. 23 that OA; = PA; consequent- 
ly, OA; = = Zez. 

By changing the indices of the basis vectors we can always 
get ez or ei to be the third basis vector (for example, assume 
that eí = ei, ej = es, ei = e). Hence it follows that proposi- 
tions similar to the one just proved are also valid for two other 
coordinates of the vector a. 

Let a plane be drawn through the terminus of the vector a 
parallel to vectors ey and es, and Az be the point where that 
plane intersects the axis of e2. Then OÀ; = Yeo, where Y is 
the second coordinate of a. 

Let a plane be drawn through the terminus of the vector 

a parallel to vectors e» and es, and let A, be t the point where 
that plane intersects the axis of ei. Then OA, = Xei, where 
X is the first coordinate of a. 

Example. A pyramid with vertices A, B, C, and D is given 
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Figure 24 


and P and Q are the midpoints of edges AD and BC respective- 
ly (Fig. : 24), find the coordinates of PQ in the basis e; = AB, 
€9 = AC, e3 = AD. 

We have 


— — 


PQ = PA + AC + CO- - 


zc AC l (CA oe ene 
TOS RT ACS (CA + AH) Eh gg 


Whence PQ = (1/2, 1/2, —1/2) in the basis ei, ej, ej. @ 
4. Operations on vectors defined by their coordinates. 
Assume that we have two vectors a and b such that a — (X, 

Y}, b = (X', Y’), in the vector basis ei, e2. Then we have 

a = Xe; + Ye, b = X'ei + Y'e, and therefore 


a+b=(X%+X’)e+(Y¥+ Y)e 
a-b=(X¥-X)a+(Y¥- Ye 

Thus 

at+b=(X+X', Y+ Y’), a-b-QX-X',Y- Y^» 


that is, when adding or subtracting vectors we add or subtract 
their respective coordinates. 

Similarly, if a = (X, Y), we can write a= Xe. + Yeo, 
`a = AXe; + AYe2, whence it follows that 


ha = (AX, XY) 


— mem 


| 
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that is, when multiplying a vector by a number we multiply 
its coordinates by that number. 

The rules of operations on vectors in space are the same as 
those in the plane; if a = (X, Y, Z) and b = (X’, Y’, Z’), 
then 


atb=(X4+X', Y-Y,Z-«Z» 
a-b=(X-X’, Y-Y’,Z-Z’') 
`a = (XX, AY, AZ) 


We can prove these equalities in the same way we did in the 
case of the plane. 

Example. Let a = (—1, 2, 5), b = (1, 3, 7). Find the vector 
Ta — Sb. 

O We have 7a = ¢ —7, 14, 35), 5b = <5, 15, 35), 7a — 5b = 
(-12, -1, 0). @ 

5. The condition of the collinearity of vectors in coordinates. 

Suppose that a = (X, Y) and b = (X', Y’) are two vectors 
in the plane, with a z 0. If b is not collinear with a, then 
b = M, where ^ is a number. Since equal vectors have equal 
coordinates, we have 


X'2AX, Y' =N (7) 


Conversely: If equalities (7) are true, then b = a, that is b 
is collinear with a. 

Thus, the vector b is collinear with the nonzero vector a if 
and only if the coordinates of b are proportional to the respec- 
tive coordinates of a. 

The same conclusion is also valid for vectors in space. 

If none of the coordinates of vector a is zero, the condition 
that vector b is collinear with vector a can be written in the 
form 


in the plane, and as 


in space. 
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For instance, the vector b = ( —2, 6, 4) is collinear with the 
vector a = (—3, 9, 6) since —2/—3 = 6/9 = 4/6. 

Example. Verify that the two vectors a = (—1, 3) and 
b = (2, 2) in the plane are noncollinear and express the vector 
c = (7, —5) in terms of the basis a, b. 

© Since —1/2 z 3/2, a and b are noncollinear. Consequent- 
ly, they form a basis in the plane. Let c = Xa + Yb. To find 
X and Y, we equate the respective coordinates of the vectors 
c and Xa + Yb and obtain the following system of two equa- 
tions in two unknowns X and Y: 


72 X(-D)4 ¥-2-2=X-3+ Y2 
By solving it we find X = —3, Y = 2. Thus, c = —3a + 2b. 6 


1.3. CARTESIAN COORDINATE SYSTEM ON A STRAIGHT 
LINE, IN THE PLANE, AND IN SPACE 


Suppose that O is a fixed point and we call it the origin. 
If M is an arbitrary point, then the vector OM is called the 
radius vector of M relative to the origin O, or in short, the 
radius vector of M. 

1. Cartesian coordinates on a straight line. 

Suppose that we have a straight line / in space. We choose 
the origin O on that line and also a nonzero vector e which 
we shall call a basis vector (Fig. 25). 

Definition. The set (0, e} of the point O and the basis vector 
e is called a Cartesian coordinate system on the straight line. 

We consider : an arbitrary point M on the straight line /. Since 
the vectors OM and e are collinear, we have OM - xe, where 
x is a number. We call this number the coordinate of M on 
the straight line. The coordinate of the origin O is zero. AII 
the points M on the line have either positive or negative coor- 
dinates depending on whether the directions of OM and e coin- 


[^] e M 
Oo ë > 
x 


Figure 25 
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Figure 26 


cide or are opposite. The straight line / on which we have in- 
troduced coordinates is called a coordinate axis or the x-axis. 

Assigning coordinates on the straight line leads to each point 
M on the line being associated with a unique number x, the 
coordinate of the point. Conversely: an arbitrary number x cor- 
responds to a unique point M for which the number is its coor- 
dinate. 

2. Cartesian coordinate in the plane. 

We choose an origin O in the plane and two collinear vectors 
e; and e; which form a vector basis. 

Definition. The set (O, ei, e2} of the point O and the vector 
basis ei, e; is called a Cartesian coordinate system in the plane. 

Two straight lines which pass through O parallel to the 
respective vectors e; and e» are called the coordinate axes, the 
first being the abscissa or the x-axis and the second, the or- 
dinate or the y-axis. 

We shall always depict the vectors e; and e; lying along the 
respective coordinate axes (Fig. 26). 

If the Cartesian coordinate system is given in the plane, then 
the position of a point M with respect to that system can be 
defined by two numbers x and y, which are the coordinates of 
M in the system. 

Definition. The coordinates of the point M in the plane with 
respect to the Cartesian coordinate system (O, ei, ez] are the 
coordinates of its radius vector OM in the basis ei, e. 

In other words, to find the coordinates of a point M, we 
should express the vector OM in terms of the basis ei, e: 


OM = xe; + ye 
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Figure 27 


The numbers x and y are the coordinates of M with respect 
to the Cartesian coordinate system (O, ei, e2}. 

The x coordinate is called the abscissa of the point M, and 
the y coordinate the ordinate of M. The coordinates of a point 
are usually given in parentheses. In compact notation, we speak 
of “the point M(x, y)". 

Thus, every point M of the plane with the fixed coordinate 
system (O, ei, ez] is associated with an ordered pair of num- 
bers (x, y). Conversely, every ordered pair of numbers (x, y) 
is associated with the only point M in the plane; this point is 
the terminus of the vector OM = xe, + ye. 

The coordinates of the origin O are 0, 0. The coordinates 
of points lying on the abscissa axis are x, 0, and the coordinates 
of points of the ordinate axis are 0, y. The remaining part of 
the plane is separaied by the coordinate axes into four compart- 
ments called quadrants (Fig. 27): we have x > 0, y > 0 for the 
points of the first quadrant, x < 0, y > 0 for those of the sec- 
ond quadrant, x « 0, y « 0 for those of the third quadrant, 
and x > 0, y < 0 for those of the fourth quadrant. 

Figure 28 shows several points with their coordinates. 

Coordinate systems underlie the methods of analytic geome- 
try and reduce any geometric problem to that of arithmetic or 
algebra. First, we arithmetize initial data. For example, to speci- 
fy a point we define its coordinates. The solution of the 
problem is also arithmetized (we will touch upon this subject 
below). The final result also has an arithmetic form; for in- 
stance, a point is found when its coordinates are found. 
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Figure 28 


3. Coordinates of a vector in Cartesian coordinate systems. 

Definition. The coordinates of a vector a in the plane with 
respect to the Cartesian coordinate system (O, ei, e2} are the 
coordinates of this vector in terms of the basis ei, e2. 

In other words, in order to find the coordinates of the vector 
a, we should express it in terms of the basis ei, ex: 


a= Xe, + Ye; 


the coefficients X and Y are the coordinates of a relative to 
the Cartesian coordinate system (O, ei, e2}. 

Let us consider the following important problem. 

Problem. Given two points A (x, a Y), B(x’, y^), find the coor- 
dinates X and Y of the vector AB. 

O We have OA = xe + ye, OB-x'ei + y'e. By 
subtracting the first equality from the second we get AB = 
(x' — x) e + Q' — y) e2. Consequently, 


X-x'-x Y=y’'’-y @ (1) 


Thus, the coordinates of a vector are equal to the difference 
of the respective coordinates of the terminus and origin of the 
vector. 

Using (1) and the condition for two vectors to be collinear 
we can derive a condition when three points of the plane, A(x, 
Y), B(x', y’), and C(x”, y"), lie on the same straight line. For 
the points A, B, and C to lie on n one straight line, it is necessary 
and sufficient that the vector AC be collinear to the vector AB, 
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Figure 29 


that is, the coordinates x” — x, y" — y of AC be > proportional 
to the respective coordinates x' — x, y' — y of AB. Thus, the 
condition for three points A, B, C to lie on one straight line 
is that the numbers x" — x and y" — y are proportional to 
x'—xy!-—. 

Example. Points A (1, 1), B(0, —3), C(3, 9) lie on one straight 
line since the numbers 3 — 1, 9 — 1 are proportional to 0 — 1, 
-3- 1. 

4. Cartesian coordinate system in space. 

Suppose we have a fixed point O (the origin) and a vector 
basis ej, ez, es in space. 

Definition. A Cartesian coordinate system in space is the set 
(O, €i, €, ex]. 

Three straight lines passing through O parallel to the cor- 
responding vectors ei, ey, and es are called the axes of coor- 
dinates and designated the x-, y-, and z-axes respectively. We 
shall always depict vectors ei, e», and es along the correspond- 
ing coordinate axes (Fig. 29). 

Definition. The coordinates of a point M in space relative 
to the Cartesian coordinate system (O, ei, €, es] are the coor- 
dinates of its radius vector OM in this system. 

In other words, the coordinates of the point M are three num- 
bers x, y, z such that 


OM = xe, + yez + ze 


Just as in the plane, x and y are the abscissa and ordinate 
of the point M respectively, the third coordinate z is the appli- 
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Figure 30 


cate of M. In compact notation, we speak of “the point M(x, 
Mo 

A Cartesian coordinate system in space yields a one-to-one 
correspondence between point M and ordered triples (x, y, Z) 
of real numbers. 

The coordinates of the origin O are 0, 0, 0. The coordinates 
of the points lying on the x-, y-, and z-axes are (x, 0, 0), (0, 
y, 0), and (0, 0, z) respectively, and the coordinates of the points 
belonging to the xOy, xOz, and yOz planes are (x, y, 0), (x, 
0, z), and (0, y, z) respectively. The coordinate planes partition 
space into eight compartments called octants. In the first oc- 
tant, x > 0, y > 0, z > 0, and it is shown in Fig. 30 by the 
straight lines which are parallel to the z-axis. 

Definition. The coordinates of a vector a in space relative 
to the Cartesian coordinate system (O, ei, e, e3} are the coor- 
dinates of this vector in the basis ei, e», es. 

As in the plane, we can find the coordinates of a vector from 
the coordinates of its origin and terminus: if A(x, y, z) and 
Bix’, y’, z^), then the coordinates X, Y, Z of the vector AB 
are X =x -x Y= y’ —y, Z =z’ — z respectively. 

Finally, the condition that the three points A(x, y, z), B(x’, 
y’, 2’), C(x", y”, z”) lie on one straight line in space is analo- 
gous to condition (2) derived for the plane: the numbers 
x” — x, y” — y, z” — z are proportional to x’ - x, y’ — y, 
245 

5. The coordinates of the point which divides a line segment 
in a given ratio. 


3— 366 
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Given two points in space, find the point C which divides 
the line segment AB in the ratio a:@. In analytic geometry, to 
find a point always means to find its coordinates. 

We use the following formula 


öc = Ê A + —2 gh 
a+ B a+ 
which was derived in Sec. 1.1., Item 7. We label the coordinates 
of point C with an asterisk and get 


B a , 
E : 
x peu c t 
B a , 
* ELS ER 
7 aa: nT » 9) 
z# =f i. E 


which is the solution of the problem. 

The same problem can be solved in the plane; then we have 
A(x, y) and B(x’, y’), and the solution is the first two formulas 
in (3). 

In a special case when o = f, formulas (3) assume the follow- 
ing form 

x*-leex) ywaloty), s-lgei) 
2 i 2 i 2 
that is, the coordinates of the midpoint of a line segment are 
equal to half-sums of the coordinates of the endpoints. 

Example 1. The line segment AB with endpoints A (7, 1) and 
B(4, —5) is divided into three equal parts. Find the coordinates 
of the points of division. 

© Suppose P is the division point closest to A. Then 
a:ß = 1:2 and the coordinates of P are 

2 1 2 
* = =S] — 9 = * —— ecu 
x 3 t 3 4-26, y 3 


For the second point of division Q we have a:8 = 2:1, conse- 


1 xz = 
1+3(-5)= -1 
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quently, 


xt = 17 " 24 Ed yrs 14 ie 24-5 223 
Thus, the division points are P(6, —1) and Q(5, —3). @ 
Example 2. Find the point which is symmetric to the point 
A(—2, 0, 7) relative to the point B(5, —1, 2). 

© If C is the required point, then B is the midpoint of the 
line segment AC. Therefore, denoting the coordinates of C by 
X, Y, Z, we get 


§=(-2+-x)/2, -1=(+y)/2, 2=(7 + zy2 
whence x = 12, y= -2, z= -3. @ 


Exercises to Chapter 1 


1.1. 
1. Find three arbitrary vectors a, b, and c in the plane and construct (a) 


3a, (b) ZL (o) 2a+ 3b, (d) TEES (e) a+ 2-6, 


1 
—-—-8-b-2. 
(f 5 


2. We have a parallelogram ABCD with AB = a and AD = b. Express the 
vectors PA, PB, PC, and PD in terms of a and b, where P is the point where 
the diagonals intersect. 

3. Two vectors a and b having the same length are laid off from a common 
origin. Prove that the vector a + b laid off from the same origin is directed 
along the bisector of the angle a, b. 

4. Two vectors a and b are laid off from a common origin. Prove that the 


vector a+ at b, laid off from the same origin, is directed along the bisector 
a 
of the angle a, b. 

5. Prove that for any two vectors a and b the inequality |a + b| < |a| + |b| 
is true. Find the condition when the equality sign is valid. 

6. Find the condition two vectors a and b should satisfy so that (a) 
|a + b| = |a — bj, (b) |a + b| > [a — bj, (c) |a + b| < ja — bl. 

7. Given a triangle ABC and a point O in the plane, a point P is constructed 


such that OP = Ho + OB + OC), prove that the sum (PA + PB + PC) 


is zero. Try to generalize the problem to an m-gon. 
_8. Prove that there are no two distinct points P and Q such that 
PA + PB + PC = 0 and QA + QB + QC=0. 


3* 
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9. The side BC of a triangle ABC is divided by points P and Q into three 
equal parts. Denoting AB = c and AC = b, express the vectors AP and AQ 
in terms of b and c. 

10. We mark a point D on the side AB of a triangle ABC so that CD is 
the bisector of z C. Express the vectors AD, BD, and CD in terms of CB — a 
and CA = b. 

11. Prove that the line segments joining the midpoints of opposite edges 
of a tetrahedron intersect, and the point of intersection is the midpoint of these 
line segments. 

12. Prove that a line segment joining the midpoints of the diagonals of a 
trapezoid is parallel to its bases. 

13. In a parallelogram ABCD the vertex B is connected to the midpoint Q 
of the side AD. Find the ratio in which the line segment BQ divides the di- 
agonal AC. 


1.2. 

1. In a regular hexagon 4A142434445A« we have A\A2 = a and A\ Ag =b. 
Express the vectors Á1A3, A144, AAs in the basis a, b. 

2. Find the values a and 6 such that the following vectors are collinear: 

(a) €-2, B), (a, —6.2), (b) (a, —2), (B, 28). 

3. Given two vectors, a = (2, 3) and b = (l, 2) in the plane, check that 
they are noncollinear and represent the vector c — (4, 9) in the basis a, b. 


1.3. 

1. Find the coordinates of the point which is symmetric to a point A(a, 
b) with respect to the origin of a Cartesian coordinate system. 

2. Given four points, A (l, 3), B(4, 7), C(2, 8), D( — 1, 4) in the plane, verify 
that the quadrilateral ABCD is a parallelogram. 

3. Given three points, A(—1, 2), B(0, 3), C(4, —5), find a fourth point D 
such that the quadrilateral is a parallelogram. 

4. Show that the four points A(3, —1, 2), B(1, 2, —1), C(-1, 1, -3), DG, 
— 5, 3) are the vertices of a trapezoid. 

5. Check whether the following points lie on one straight line: (a) A(3, 1), 
B(-2, —9), C(8, 11), (b) A(0, 2), B(-1, 5), CG, 4), (C) A(l, —5, 3), BG, 
—], 7), C(6, 0, 8). 

6. Find the coordinates of the midpoint of the line segment AB with end- 
points A(2, 3) and B(- 4, 7). 

7. Find the point which is symmetric to the point A(—3, 0) relative to the 
point B(2, 9). 

8. The line segment AB with endpoints A(l, —3, —5) and B(7, 3, 4) is divid- 
ed into three equal parts. Find the division points. 

9. A line segment AB is divided by the points P(0, 5, 2) and Q(l, 7, 2) 
into three equal parts. Find its endpoints. 

10. Find the point at which the medians of a triangle are concurrent (its 
centroid). The vertices of the triangle are at the points A(l, 4), B(—5, 0), 
C(-2, 1). 
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RECTANGULAR CARTESIAN COORDINATES. 
SIMPLE PROBLEMS IN ANALYTIC GEOMETRY 


2.1. PROJECTION OF A VECTOR ON AN AXIS 


Suppose that a straight line is specified in space. We choose 
the positive direction on it. 

Definition. An axis is a straight line having the positive direc- 
tion and a unit of measurement. 

The positive direction and the unit of length are usually 
specified simultaneously by a vector e having length 1 which 
is parallel to that straight line. This vector is called a unit vector 
or basis vector (Fig. 31). 

We denote the straight line by / and drop the perpendicular 
to it from an arbitrary point A. The point A‘, which is the 
foot of the perpendicular, is the orthogonal projection (or sim- 
ply projection) of A on the axis /. A 

Definition. Let / be an arbitrary axis and AB an arbitrary 
vector in space. The vector A’B’ whose initial and terminal 
points are the projections of the points A and B on the /-axis 
is called the vector projection of AB on the l axis (Fig. 32). 

Along with the projection of a vector there is also the scalar 
projection of a vector on an axis. E 

Definition. The scalar projection of a vector AB on the / 


Figure 31 Figure 32 
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axis is a number ^ such that 
A'B’ =de 


We denote a scalar projection by proj, AB, or sometimes 
proj aAB, where a is a nonzero vector directed along the posi- 
tive / axis. The absolute value of the number proj;AB is equal 
to the length of A’B’, where the sign indicates whether the 
vector has the same or opposite direction to the unit vector e 
of the / axis. 

Below we shall mostly use scalar projections, and so when 
we say "projection" we shall always mean a scalar projection, 
unless otherwise specified. 

l. Angle between vectors. Angle between a vector and an 
axis. 

Suppose that a and b are two arbitrary vectors. We place their 
initial points at the same point and draw a plane through them. 

The angle o between two vectors a and b is the angle of the 
shortest rotation from a to b in the plane of the vectors 
(Fig. 33). Obviously, 0 € o < v. If at least one of the vectors 
a, b is zero, the angle pm them is undefined. 

The angle between a vector a and the / axis is the angle ¢ 
between a and the unit vector e of the / a» axis (Fig. 34). 

We denote the angle ¢ by (a, a, b) or (a, a, I). 

2. Theorem on the projection of a vector on an axis. 

Theorem. The projection of a vector a on the | axis is equal 
to the product of the length of a and the cosine of the angle 
between a and the 1 axis: 


projza = |a| cos (a, 7) (1) 


Figure 33 Figure 34 
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proj, a >0 e proja=0 e proj, a« O 


Figure 35 


Formula (1) follows directly from the definition of cosine. 
Figure 35 shows some possible cases. 

3. Properties of projections. 

We shall derive the three most important properties of 
projections of vectors. 

19, Equal vectors have equal projections on the same axis. 

Indeed, if a = b, then |a| = |b| and (a, /) = (b, J), whence 
by formula (1) it follows that proja = proj:b. 

29. The projection of the sum of several vectors on an axis 
is equal to the sum of their projections on that axis: 


proji(a + b + ...) = proja + projb + ... 


To ascertain the validity of this property, it is sufficient to 
verify it for two summands. 

We construct a broken line OAB containing the directed line 
segments OA = a and AB-b (Fig. 36). We project the points 
O, A, B onto the / axis and get points O', A', B'. 
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Figure 36 
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Figure 37 


Suppose that ^ is the projection of the the vector a and ind y is the 
projection of b on the / axis. Then O’A’ = = e, A'B' = pe, 
where e is the unit vector of the / axis. Whence 
On = (^ + u)e, and this means that X + p is the projection 
of OB (Le. of the vector a + b) on the / axis. Thus, 


proj,(a + b) = proj; a + proj; b 


39. When multiplying a vector into a number, we multiply 
its projection by that number: 


proji(ka) = k proj; a 


o We translate the initial point of a to some point on the 
laxis. Let a’ be the vector projection of a on the / axis (Fig. 37). 
We have a’ = Xe, where ^ is the projection of a on the / axis. 

First we consider the case where k > 0. “Extension” of a k 
times obviously causes “extension” of a’ k times, that is, it be- 
comes a vector ka’. But ka’ = k(Xe) = (kA) e. Whence it fol- 
lows that proj(ka) = KX = k-proja. 

We suggest the reader consider the case where k < 0 indepen- 
dently. When k = 0 property 3° is obvious. m 


2.2. RECTANGULAR CARTESIAN COORDINATE SYSTEM 


A rectangular Cartesian coordinate system is the simplest of 
Cartesian coordinate systems. 

Definition. The Cartesian coordinate system (O, ei, e;] in 
the plane is called rectangular if e; and e? are perpendicular 
unit vectors. 
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M(x,y) 


Figure 38 


Similarly, we can define the rectangular Cartesian coordinate 
system (O, ei, ez, e3} in space; here ei, ez, and e3 should also 
be mutually perpendicular unit vectors. 

The basis unit vectors e; and e» of a rectangular Cartesian 
coordinate system in the plane are usually denoted by i, j, and 
the basis unit vectors ej, ez, and es in a rectangular Cartesian 
coordinate system in space are usually denoted by i, j, k. Ac- 
cordingly, an arbitrary radius vector OM can be represented in 
the basis as 


OM = xi + yj (in the plane) 
OM = xi + yj + zk (in space) 


The coordinates of the point M are x, y, in the first case, and 
X, y, Z in the second (Fig. 38). 

In what follows we shall say a rectangular coordinate system 
rather than a rectangular Cartesian coordinate system, and 
coordinates x, y in the plane and x, y, z in space we shall call 
rectangular coordinates. 

When considering rectangular coordinate systems we should 
note that there are right-handed and left-handed rectangular 
coordinate systems. 

In a left-handed rectangular coordinate system, the 90° rota- 
tion from the vector i to the vector j appears clockwise (Fig. 39), 
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i i 
right-handed system left-handed system 


Figure 39 


while in a right-handed coordinate system this rotation is coun- 
terclockwise. These two coordinate systems are different and 
no displacements in the plane can make the left-handed pair 
i, j coincide with the right-handed one. 

In a left-handed rectangular coordinate system in space, 
when three vectors i, j, k originate from a common point, the 
90° rotation from i to j when viewed from the terminus of k 
appears clockwise, while in a right-handed coordinate system 
such a rotation appears counterclockwise (Fig. 40). The two 
systems differ in the same way as left-hand and right-hand 
screws do. 

In this book we shall only use right-handed coordinate 
systems. 

1. Coordinates of a vector in a rectangular system,as its 
projections on coordinate axes. 

Here are some remarks. Suppose that in a rectangular system 
in the plane we have a point A with coordinates x and y. Then 
A1(x, 0) and A2(0, y) are its projections on the coordinate axes, 
which follows directly from our reasoning (see Sec. 1.2., 
Item 1). Similarly, if in a rectangular system in space the coor- 


right-handed system j left-handed system 


Figure 40 
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dinates of a point A are x, y, Z, then Ai(x, 0, 0), A2(0, y, 0), 
A3(0, 0, z) are its projections on the coordinate axes (see 
Sec. 1.2., Item 3). E 

Theorem. The coordinates of a vector AB in a rectangular 
coordinate system coincide with its projections on the coor- 
dinate axes. 

o Suppose we are given A(x, y, 2) and B(x’, y', z’). We 
have proved that the coordinates of AB in the given coordinate 
system are x’ —x, y' —y, z' — z. 

Projecting the points A and B onto the coordinate axes yields 
the points A1, A2, A3 and Bı, B2, B3. We have 


AB = OB, - OA: = x'i- xi= (x' - i 


and, similarly, A;B,- (í'-»i A3B3 = (z' — z) k. These 
equalities imply that 


proj.AB -x'-2 projy AB -y'-y proj,AB =z -Z m 


2. Length of a line segment in coordinates. 

Suppose that we have a coordinate axis with origin O and 
basis unit vector e. We. mark two points, A(xi) and B(x) 
(Fig. 41). Then AB = OB - OA = (x; — xı) e. Hence it fol- 
lows that the distance between A and B is 

|AB| = ba - xıl 

Let us now choose two points in a rectangular coordinate 
system, A(x1, yı) and B(x2, y2). We first assume that the line 
segment AB is not parallel to either coordinate axis. Through 


A and B we draw straight lines parallel to the coordinate axes 
(Fig. 42). By the Pythagorean theorem we have |AB|? = 


0 A(x) B(x;) 


Figure 41 Figure 42 
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Figure 43 


|AP|? + |BP|? from the right triangle ABP But |AP| = 
IprojAB| = pe — xıl, |BP| = |projyAB| = |y — yıl, whence it 
follows that 

IABI* = po - ail? + p = Wil? 
Thus, 


|AB| = V Go = x) + 0» - xy (1) 


Formula (1) is also valid when the line segment AB is parallel 
to either the x-axis or y-axis. 

Formula (1) expresses the length of a line segment in the 
plane via the coordinates of its endpoints. 

Finally, we consider points A and B in space. We have A(x, 
Ji, Zi), and B(x, y», z2). Assume first that the line segment 
AB is not parallel to the xy coordinate plane. We project A 
and B onto that plane and get the points A'(xi, yı, 0) and 
B' (x2, y2, 0). According to what we have proved, |A’ B’ |? = 
(x2 — x) + Or - Y. 

Through A and B we draw planes parallel to the z-axis 
(Fig. 43). By the Pythagorean theorem, from the triangle ABP 
we have 


|AB|? = |AP|? + |PB|’ = |A'B' |? + |PB|? 


But |PB| = |z2 — zi, since |PB| is the absolute value of the 
projection of AB on the z-axis. Whence 


|AB|? = (2 - x? + 02 - Y + (o - zy 
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Thus, 
|AB| = NQa — x)! + Or — Yi + (aa — ay (2) 


The last formula is also valid when the line segment AB is 
parallel to the xy plane (then |AB| = |A’ B'|, and z2 — zi = 0). 

Formula (2) expresses the length of a line segment in space 
via the coordinates of its endpoints. 

Example 1. Find the distance between the points A(—1, 1, 
5) and Bil, 3, 4). 

© From formula (2) we have 


lap) = V0 = (iy +0- 37 + 6-4 =3. © 


Example 2. Find the point in the plane equidistant from the 
three points, A(11, 3), B(10, 6), C(—1, 9). In other words, find 
the circumcentre of the triangle ABC. 

© Suppose that P(x, y) is the required point. We write the 
conditions |PA| = |PB| and |PA| = |PC| as follows: 

Væ- 11)? + Q — 3 = V@ — 10)? + & - 6)” 
Væ- 1D? +O- 3% = Vet 1D? +O - 97 


or after squaring and simplifying, as x —3y = -3, 
2x— y =4. Solving this system of two equations in two 
unknowns we find that x = 3, y = 2. The required point is P(3, 
2). @ 

3. Length of a vector in coordinates. 

In Sec. 1.3 we derived formulas for the coordinates of a vec- 
tor AB in terms of the coordinates of its initial and 
terminal points: 


X-2xí1—-x, Y-y-J) 
when A (a, yi) and B(x2, y2) are two points in the plane, and 
X-2x-x, Y-2y-y, 22-2 


when A (xi, yi, zi) and B(x2, y2, z2) are two points in space. 
Using formulas (1) and (2) we can now express the length of 
an arbitrary vector AB = a in terms of its coordinates in a rec- 


46 Part One. Analytic Geometry 


Figure 44 


tangular coordinate system: 
la| = VX? + Y? 
if a = CX, Y) is a vector in the plane, and 
jal =VX +Y +z 

if a = (X, Y, Z) is the vector in space. 

Example. The length of the vector a = ( —3, 4) in the plane 
is V(—3)? - 4 2 5. The length of the vector a= (1, 
—2, 2) in space is V1* + (—2)? + 2? = 3. 

4. Direction cosines. 

Suppose that a is a nonzero vector in space. We denote by 
a, B, y the angles between the vector and the x-, y-, and z-axes 
of the rectangular coordinate system (Fig. 44). Since the projec- 
tions of a vector on the coordinate axes coincide (according 


to what we have proved) with respective coordinates X, Y, Z 
of this vector, we can write 


X = |a| cosa, Y= la| cos 8, Z= la| cos y (3) 


Thus, each coordinate of the vector is equal to the product 
of its length and the cosine of the angle between the vector 
and the corresponding coordinate axis. 

Definition. The numbers cos o, cos 6, cos y are called the 
direction cosines of the vector a. 

The direction cosines are related as follows: 


cos? a + cos? 8 + cos? y = 1 (4) 


| 
| 
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Indeed, |a? = X? + Y? + Z?, whence according to (3) 
lal? = |a|?(cos? a + cos? 8 + cos? y) 


canceling out |a|? we arrive at equality (4). 

Example 1. Find the direction cosines of the vector AB if 
A(l, —1, 3) and BQ, 1, 1). " 

O The coordinates of AB are X=2-1=1, 
Y-1-(-)-22, Z-1-3--2. Whence, |AB|* = 

1? + 2? + (—2)? = 3. We now find the cosine vectors from 
formulas (3): 


Y. 2 
a x a 4.9 

Example 2. The vector a makes angles of 60? with the x- 
and y-axes. Find the angle between a and the z-axis. 

© Denoting the required angle by y we can write 
cos? 60° + cos? 60° + cos? y = 1, whence it follows that 
cos y = +1/V2. We have two solutions: y = 45° and y = 135°. 
Obviously, they correspond to two symmetric vectors relative 
to the xy coordinate plane. @ 

Example 3. A vector a makes equal acute angles with the 
coordinate axes. Find the angles. 

© In this case a = 6 = y, and therefore formula (4) gives 
3cos?a = 1. Whence we have cosa = +1/V3. Since, by 
hypothesis, the angle œ is acute, we take the positive value: 
cos a = 1/V3. From the table of cosines we find that 
a = 55°. @ 

We note in conclusion that if / is an axis in space and e is 
a basis unit vector on the axis, then the direction cosines of 
e are called the direction cosines of the axis itself. 


cosac S od. cos 8 = E 2 
al 3 al 


2.3. SCALAR PRODUCT OF VECTORS 


1. Definition of a scalar product. 

Consider the following problem. A constant force F exerted 
along the line of motion moves a particle of mass 1 from point 
P to point Q. Find the work done. We know from the physics 
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course that work is the product of the force and the distance 
moved and the cosine of the angle between the directions of 
the force and motion. In vector notation we can write 


A = IF||PO| cos (F, PQ) 


where A is the required work. 

There is a single operation which takes two vectors (or three 
numbers, viz. the lengths of the vectors and the cosine of the 
angle between them) to produce a scalar A. This operation is 
called the scalar multiplication of two vectors. We shall see be- 
low that this operation is widely used. 

Definition. The scalar (or dot) product of two nonzero vec- 
tors a and b is the number 


la||b| cos (a, b) 


If one of the vectors a or b is zero, their scalar product is zero. 
The scalar product of vectors a and b is denoted as a-b. Thus, 


a-b = [a|ib| cos e (1) 


where ¢ is the angle between a and b. The cases where a = 0 
or b = 0 can also be described by formula (1) since then |a| = 0 
or |b| = 0 (the angle e is undefined). 

The scalar product is closely related to the projections of vec- 
tors. If proj,b is the projection of vector b on the axis whose 
direction is the same as that of a, then, according to the the- 
orem on projections, we have proj,b = |b| cos o, proj,a = 
la| cos e, which together with formula (1) imply that 


a'b = lal proja b, a-b = |b| proj,a (2) 


that is, the scalar product of two vectors is equal to the product 
of the length of one of them multiplied by the projection of 
the other on the direction of the first vector. 

Note too that the scalar product of a vector with itself is 
the square of its length 


a:a = [a|[a| cos 0 = |a]? 


since the vector a forms the angle o = 0 with itself. 
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2. Properties of the scalar product. 
19. Commutativity: 


a-b=b-a 


This property follows directly from the definition of the sca- 
lar product. 
2°. Associativity relative to the multiplication of a vector by 
a scalar: 


(ka)-b = k(a-b) 
The property follows from the properties of projections since 
(ka)-b = |b| proj, (ka) = |b| kprojya = k|b| projpa = k(a-b) 
3°. Distributivity over vector addition: 
(a+ b)-c=a-c+ b:c 


We can prove this property using the properties of 
projections 


(a + b)-c = [e| proj,(a + b) = |e|(proj.a + proj-b) 
= |e|proj,a + |e| projcb = a-c + b-c 


4°. The property of sign. If a and b are two nonzero vectors 
and ¢ is the angle between them, then 

a-b>0 if the angle o is acute, 

a-b <0 if the angle ¢ is obtuse, 

a-b=0 if the angle ¢ is right. 

Indeed, since |a| > 0 and |b| > 0, we have 

the inequality a-b > 0 is equivalent to cos y > 0, 

the inequality a-b < 0 is equivalent to cos 9 < 0, 

the equality a-b = 0 is equivalent to cos 9 = 0. 

5°. The vectors a and b are perpendicular if and only if their 
scalar product is zero. 

Indeed, if a L b, then yg = 2/2, whence it follows that 
cos » = 0 and therefore a-b = 0. Conversely, if a-b = 0, then 
either jaj = 0 or |b| = 0 or cos e = 0. In the first and second 
cases a L b since the zero vector is perpendicular to any vector, 
in the third case v = 2/2 and a L b. 


4— 366 
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Here is an example of how to apply the properties of the 
scalar product. 

Example. PO is a principal diagonal of a parallelepiped 
(Fig. 45), a, b, and c are the edges emanating from the vertex 
P, and the angles between these edges are a (between b and 
c), B (between a and c), y (between a and b). Find the length 
of the diagonal PQ. > 

© We designate the vectors (edges) emanating from the ver- 
tex P as a, b, and c ([a| =a, |b| = b, (e| = c). Obviously, 
PQ =a + b + c. Whence it follows 


PO’ =(at+b+ec)(at+bt+c)=aat+a-b+a-c 
+bat+bb+bet+ca+eb+ec-c¢ 


Since a-a =a’, b-b = D?, c-c c, a-b = b-a = ab cos Y, 
a-c = c-a = ac cos B, b-c = cb = bc cos a, finally we get 


|PO|? = a? + b? + c? + 2ab cos y 
+ 2ac cos B + 2bc cos a. @ 


3. Scalar product and the test for a straight line to be perpen- 
dicular to a plane. 

We can use the properties of the scalar product for moving 
some geometric theorems. 


Figure 45 Figure 46 
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Theorem (the test for a straight line to be perpendicular to 
a plane). Zf a straight line l in space is perpendicular to any 
two intersecting straight lines of a plane a, then lis perpendicu- 
lar to any straight line in that plane. 

n We denote by a and b two intersecting straight lines of 
the plane œ which are perpendicular to a straight line /. Let 
c be a Straight line belonging to the plane and passing through 
the point of intersection of a and b (Fig. 46). We choose non- 
zero vectors l, a, b, and c on the straight lines /, a, b, and c 
respectively. By the hypothesis we have a-l = 0, b-l = 0. 

The vectors a and b are noncollinear (the straight lines a and 
b intersect), and therefore we can represent the vector c as their 
linear combination 


c= a + pb 
where ^ and » are some numbers. Whence 
c-l = (ha + ub)-1 = A(a-]) + p(b-1) = 0 


Consequently, the vector ¢ is perpendicular to the straight line 
l. Noting that c is any one of vectors of the plane o, we con- 
clude that any straight line of the plane a is perpendicular to 
l. uw 

4. Representation of the scalar product in terms of the coor- 
dinates of vectors. 

Suppose that the coordinates of vectors a and b are specified 
in a rectangular coordinate system in the plane: a = (X1, Yı), 
b = (X2, Y2). Then the following important formula is valid: 


a-b = Xi X5 + Yi: Y2 
that is, the scalar product is equal to the sum of the products 
of the corresponding coordinates. 


To derive this formula, we express these vectors in terms of 
the basis unit vectors: 


a- Xii * Yij b= Xi - Yj 
and then take the scalar product. We have 
ab = (XX) ii + QG Y) ij + NX ji + m Y2) jj 
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Since i-i = j-j = 1 (the length of i and j is unity) and 
i-j = j-i = 0 (iis perpendicular to j), the last expression reduces 
to the required form 


a:b = XiX + Vi ¥2 


If a and b are two vectors in space and a = (Xi, Yi, Zi), 
= (X5, Y», Z2), then similar reasoning leads to the following 
equality: 


ab = XiX + YiY; + ZZ 
which can be interpreted in the same way as in the plane. 
In particular, for one vector a = (X, Y, Z) we have 
aa = jal? = X? + Y2 + z 
whence follows the familiar formula 


la] = Vx? + Y  z 


5. Application of scalar products. 
The following formula is widely used in practice: 


a-b = XiX + YiYo + ZiZ; Q) 


(1) The perpendicularity test for two vectors. Two vectors 
a and b are perpendicular if and only if 


XiX + Yi + ZZ 50 


The proposition follows immediately from formula (3) and 
property 5? of the scalar product. 

(2) Angle between two vectors. Suppose that a and b are two 
nonzero vectors and ¢ is the angle between them. From the 
definition of the scalar product it follows that 

a-b 


cos = ——— 
? = ajib] 


or 
Xið + YiY) + ZiZ; 


COS p = eei eden eMe 
VX + Yi + ZiNXE Y) Zi 


(4) 
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(3) Projection of a vector on an axis. Suppose that we have 
an axis / in space with a unit vector e which makes angles a, 
B, y with the coordinate axes. Then the projection of an ar- 
bitrary vectors a = (X, Y, Z) on the / axis is 


proj.a = X cosa + Y cos8 + Z cos y (5) 


Indeed, since the length of e is unity, its projections on the 
coordinate axes are 1-cos a, 1-cos 6, 1-cos y. Consequently, 
the coordinates of e are also equal to cos a, cos f, cos y. Since 


projea = |e|projea = a-e 


we have formula (5). 

Example 1. Given the points A(2, 0, 1), B(2, 1, 0), C(1, 0, 
0). Find angle ABC. 

O We consider the vectors BA =Q-2,0-1,1-0)= 
(0 -1, 1) and BC = <1 - 2, 0 — 1, 0 — 0) = (-1, -1, 0). 
The required angle ABC is the angle « between these vectors. 
From the formula for the cosine of the angle between two vec- 
tors we have 


BA-BC _ 0-(-1) + (-1)-(-1) + 1-0 
|BA||BC} VO + (-1)? + 12V(—1)? + (-13 + 9? 


os Y= 


N| = 


Hence, o = 60°. @ 

Example 2. Find the projection of the vector a = (1, 2, 3) 
on the / axis which forms equal acute angles with the coordinate 
axes. 

© Example 3 in Sec. 2.2., Item 4 shows that the direction 
cosines of the / axis are cos œ = cos 8 = cos y = 1/V3. Conse- 
quently, 

proja = 1-7 + 2 * 3-7, = 2V3 @ 
Example 3. Given two vectors a = (1, 2, —1) and b = (2, 
—], 3). Find proj,b. 
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O Since a-b = |a|proj,b, we have 
ab _ 1-2 + 2-(-1) + (21-3 
la| VP +2? e (-1Y 


proj,b 


Il 
w 
m 
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2.4. POLAR COORDINATES 


Coordinate systems are used to locate points (in the plane 
or in space) by specifying their coordinates. Although the rec- 
tangular Cartesian coordinate system is most popular, other 
coordinates, eg. the polar system, are also widely used. The 
polar coordinate system in the plane is defined by fixed point 
O, called the pole (Fig. 47), a fixed ray p emanating from O 
and called the polar axis, and a scale for length measurement. 

The position of a point M different from the pole is defined 
in polar coordinates by two numbers, r and e. The number 
r > 0 is the first coordinate and called the polar radius; it is 
equal to the distance from the pole O to the point M. The num- 
ber e is the second coordinate and called the polar angle. We 
choose the positive direction of reckoning the angles; as is usual 
in trigonometry, the positive direction is counterclockwise. 
Then ¢ is the angle through which the ray p is rotated in the 
positive direction for p to coincide with the ray OM. If we ro- 
tate p clockwise, then e is a negative number. Obviously, the 
angle e lies within 0 and 27; in other words, the polar angle 
for point M is » of any of the angles o + 27n, where n is an 
integer. The pair of numbers (r, o) are called the polar coor- 
dinates of the point M. 


Pd 


9) P 


Figure 47 
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If M is not the pole, then r > 0 and ¢ lies within 277. Now, 
if M coincides with the pole, then r = 0 and ¢ does not matter. 
Conversely, given the pair of numbers (7, e) with r > 0, we can 
construct the point M for which r and e are polar coordinates. 


Exercises to Chapter 2 
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1. Find the projection of the unit vector e on the / axis with which e makes 
the following angles: (a) 30°, (b) 45°, (c) 120°, (d) 90°. 


2. Given proja = 1 jaj, find the angle between a and the / axis. 


3. Vectors a, b, and c form a triangle ie a+b+e=0. Find 
proj,(b + c). 

4. Prove that the sum of the projections of vectors a and b on an axis having 
the same direction as the vector a + b is equal to ja + bj. 


2.2 


1. Find the coordinates of points symmetric to the point A(— 3, 1) relative 
to the coordinate axes. 

2. Find the coordinates of points which are symmetric to the point A(—3, 
1, 2) relative to (a) the coordinate axes, (b) the xy, xz, yz coordinate planes. 

3. Find the lengths of the sides of triangle ABC if A(3, 2), B(-1, —-1), 
c(i, —6). 

4. Find the point on the y-axis 5 units from the point A(4, — 6). 

5. Given three points in the plane, A(3, 7), B(t, 3), and C(7, 5), find the 
point which is symmetric to A relative to the straight line BC. 

6. Find the center and radius of a circle passing through the point A(— 2, 
4) and touching the coordinate axes. 

7. Given two vectors a = (2, —5,3) and b = (4, —3, —3). Find the lengths 
of the vectors a + b and 2a — 3b. 

8. Given a triangle with the vertices A(3, —5), B(—3, 3), C(-1, —2), find 
the length of the median AD, where D is the midpoint of the side BC. 

9. Find the length and direction cosines of the vector AB with A(—2, 1, 
3) and B(0, —1, 2). 

10. Find out whether a vector in space can make the following angles with 
the coordinate axes: (a) 45°, 60°, 120°, (b) 45°, 135°, 60°. 1f the answer is affir- 
mative, find the coordinates of the vector assuming its length to be unity. 

11. Find the coordinates of the point M in space if the length of its radius 
vector equals 8 units and the angle of inclination to the x-axis is 45? and to 
the z-axis is 60°. 
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1. Check the validity of the following equalities: (a) (a + b)? = a? + 
2a-b + b°, (b) (a-b)(a-b)-a?—-b? (c) (a+b)? + (a-b? = 
2ial? + 2|bj?. 

2. Three vectors a, b, and c form a triangle, ie. a + b + c = 0. Express 
the length of c in terms of a and b. 

3. Two forces P and Q are applied to the same point. The forces act at an 
angle of 120° to each other and |P| = 4 and |Q| = 7. Find the value of the 
resulting force R. 

4. Prove that a triangle with vertices (a) A(0, 0), B(3, 1), C(1, 7) is right, 
(b) AQ, —1), B(4, 8), C(10, 6) is obtuse. 

5. Check whether the quadrilateral with vertices A(5, 2, 6), B(6, 4, 4), C(4, 
3, 2), D(3, 1, —4) is a square. 

6. Find the value of a such that the vectors ( — 2, o, 3) and <2, a, a) are 
perpendicular. 

7. Find the angle between two vectors a = 3p + 2q and b = p + 5q, where 
p and q are two perpendicular vectors. 

8. Find the angle between the bisectors of the coordinate angles xOy and 
yOz. 

9. Find the cosine of the angle between a diagonal and edge of the cube. 

10. A parallelogram is constructed on the given vectors a and b. Express 
the vector of its height which is perpendicular to the side a in terms of the 
vectors a and b. 

11. Find proj,b if a = (1, 2, —2) and b = (0, 5, 2). 

12. Given the points A(—3, 1, 5), B(3, 9, 5), CO, 0, 1), D(4, 1, —1), find 
the projection of the vector CD onto the axis having the same direction as the 
vector AB. 

13. Prove that the vector (a-e) b — (a:b) c is perpendicular to the vector a. 


2.4 


1. Construct the points whose polar coordinates are (a) (3, x/3), (b) ( 
2") . © (3, - 1/6). 


2. Find the polar coordinates of the point symmetric to the point (1, x/4) 
relative to (a) the pole, (b) the polar axis. 

3. Locate the points in the plane whose polar coordinates satisfy the follow- 
ing conditions: (a) r= 1, (b) r-3, () 1«r«3, (d) e- «A3, (e) 
7/2 < o < 37/2. 
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Chapter 3 


DETERMINANTS 


3.1. SECOND-ORDER DETERMINANTS. CRAMER’S RULE 


Suppose we have a square array consisting of four numbers: 


dı bi 
( b ) (1) 
a2 02 
Such an array is called a matrix or, to be more precise, a matrix 


of order (or dimension) 2 x 2 or a square of order two. 
Definition. The determinant of matrix (1) is the number 


a, bz = abı 
and is denoted by 
Q1 bi 
s (2) 
Thus 
aim | = aib — arbi 
a2 bz 


Example. We have 


]*-2 
3:55 


We should distinguish between matrix (1) and its determinant 
(2), namely, a matrix is an array of four numbers, while a deter- 
minant is the only number obtained from the array as indicated 
above. 

The numbers a1, b1, a2, b; are called the elements of matrix 
(1). The matrix has two rows and two columns. The elements 
41, b, and a5, b? comprise the first and the second row respec- 
tively, and the elements a1, az and bı, b; comprise the first and 
the second column respectively. 


| = 1(-5) - (-2)3 = 1 
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The same notions of elements, rows, and columns are also 
valid for determinant (2). We say that determinant (2) is com- 
posed of the elements ai, bi, a», b2. 

Although the expression a152 — a,b, for the determinant is 
not complicated, it is advisable to memorize the following 
scheme for evaluating it: 


EN by ay b, 
ag bo ag be 
NONU 


The line segment marked with the + sign connects two ele- 
ments whose product should be taken with the plus sign, and 
the line segment marked with the — sign connects two elements 
whose product should be taken with the minus sign. 

Determinants of second-order square matrices are called 
second-order determinants. 

1. Cramer's rule for solving a system of two second-order 
equations in two unknowns. 

Suppose we have a system of two equations 


aix + by =c 
Qux + by =o 


(3) 


We use the coefficients of the unknowns to form the deter- 
minant 

[^41 bi 
a2 b2 


We shall call it the determinant of system (3) and denote it by A 
We shall also need two other determinants, namely 
Ci bi 


c bz 


The determinant A; is obtained from A by replacing the ele- 
ments a1, a2 of the first column (they are the coefficients of 


Q1 C1 
a2 02 


Ay = , 2 — 
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x in the equations of the system) by the constant terms ci, C2. 
The determinant A» is formed in a similar way by replacing 
the elements of the second column in A by the constant terms. 
Theorem (Cramer's rule for a 2 x 2 system). 7f the deter- 
minant A of system (3) is nonzero, the system has a unique 
solution. The solution can be found from the formulas: 


x- = (4) 


u By multiplying both sides of the first equation in (4) by 
b; and of the second equation by — bı and adding the results, 
we obtain 

(a1b2 — abi) x = cibo — abi 


or, which is the same, A-x = A1. Whence we have 
= Ai 
A 


Similarly, by multiplying both sides of the first equation by 
— a and of the second equation by a; and adding the results 
we arrive at the equation 


(aib2 — abı) y = ac. — ci 
or, briefly, A-y = A». Whence we have 
TA 


Thus we see that if a system of equations is solvable, its solu- 
tion is given by (4). Now it remains for us to check whether 


A A ; s 
the numbers x = 5 , y= = constitute the solution of the sys- 


A 
tem, that is, whether the following equalities 
Al A» _ Ai A2 = 
arn Oe ae Gy + ba = 


hold true. We have 


aA, + D1A2 = ai(cibo — cobi) + bi (ac, — aac) 
ci(aib) — abı) = aid 
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and this proves the validity of the first equality. The second 
equality can be checked in a similar way. m 
Example. Solve the system 


3x — Sy = 
x-2y-21 
O We have 
3-5 
a= zie 
consequently 
[i = E i| 
1 -2 11 


The theorem we have just proved is called Cramer’s rule (to 
be more accurate, Cramer’s rule for a system of two equations 
in two unknowns). We shall discuss Cramer’s rule for the gener- 
al case (ie. for a system of n equations in n unknowns) in 
Sec. 3.15. 


32. THIRD-ORDER DETERMINANTS 


We shall now consider a square matrix of third order, that 
is, an array of 3 x 3 numbers, 


[^4] bi Cl 
a2 b; C2 (1) 
3 b3 C3 


The notions of element, row, and column are equivalent to 
those of matrices of second order. 
Definition. The determinant of matrix (1) is the number 


a1b5€3 + b1coa3 + €135D3 — ¢b2a3 — biazc3 — a1c2bs (2) 


Determinants of square matrices of third order are called 
third-order determinants. 


——— 
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The determinant of matrix (1) is written as 


ai bi €1 
a b © (3) 
a3 b3 C3 


Thus by definition we have 


[^4] bi €] 
a bı & | = dibacs + bicoas + c1a2b3 — ciboa3. (4) 
03 b3 C3 = b1a2€3 = a@102b3 


The numbers ai, bi, ci, a2, b2, c», a3, b3, c3 are called the 
elements of determinant (3). We say that determinant (3) is 
formed from elements ay, bi, ci, a2, b2, co, a3, b3, C3. 

Although (4) seems to be complicated, it can be easily 
remembered. If we connect with a dashed line each three ele- 
ments of the determinant whose product is taken with the plus 
sign in (4), we obtain scheme 1, which can be easily memorized. 
Similarly, for the products taken with the minus sign in (4) we 
have scheme 2. 


NY 2a MA p 
b /> CN NO 


A PN. Z NOX 
47V CN f v 
Scheme ] Scheme 2 


The schemes illustrate the triangle rule for evaluating third- 
order determinants. 
Example. Compute the third-order determinant 


12 3 
01 -1 
24 5 


62 Part One. Analytic Geometry 


O We draw the schemes 


and find that the determinant is equal to 1-1-5 + 2-(—1):2 + 
0-4-3 — 3-1-2 — 2-0-5 — (-1)-4-1 = -1. @ 

With practice the reader will be able to apply the schemes 
mentally without drawing them. 

Note that each of the six products in (2), say a155c3, is called 
a term of the determinant. Any term of the determinant con- 
tains either of the letters a, b, and c as a factor; in other words, 
each term contains one element of the first, second, and third 
columns. The indices of these letters in each term are permuta- 
tions of the numbers 1, 2, 3, that is, each term contains one 
element of the first, second, and third rows. This remark will 
serve as a basis for defining determinants of an arbitrary order 
n. 


3.3. nth-ORDER DETERMINANTS 


1. Notation. 

We shall now use one letter, say a, to denote the elements 
in a determinant with indices showing the position of the ele- 
ment in the determinant rather than use different letters (a, 
b, c, and so on) as we did above: namely, we denote by aj; an 
element in the ith row and jth column. For instance, 212 is the 
element of the first row and the second column (read “a one 
two" and not “a twelve"). 

In this notation, formulas for second- and third-order deter- 
minants are: 
Qi, 212 


= 411022 — d12021 
021 022 
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Qi1 di2 di3 
G21 022 d23 | = 411022033 + 412023031 + d13021032 
a31 032 833 — 013022031 — d12021033 — 411023832 


2. A permutation of the numbers 1, 2, .... n. Even and odd 
permutations. 

A permutation of the numbers 1, 2, ..., n is their arrange- 
ment in some order (not necessarily increasing). For instance, 
3, 2, 1, 4 is a permutation of the numbers 1, 2, 3, 4. 

The number of permutations of 1, 2, ..., n is the product 
1-2:3.;.n. 

n We use P, to denote the number of permutations of 1, 
2, ..., n. Obviously, P, = 2 = 1-2. We shall consider now a 
permutation of the numbers 1, 2, ..., n— 1: 


1151,55, Un Sal (1) 


We can use it to obtain the permutations of 1, 2, ..., n if we 
write the number n either on the left of i1, or between i; and 
iz, or between i and 73, and so on, or on the right of i,-1. 
In this way we can obtain n permutations of the numbers 1, 
2, ..., N from permutation (1). Since the number of permuta- 
tions (1) is P,-1, we have the following formula: 


P, = nPn-1 
Specifically, 
P; = 3-P, = 3-2-1, Py = 4-P3 = 4-3-2-1 
Ps = 5-Py = 5:4-3-2-1 
In general, P, = n:(n — 1)...3-2-1. m 
This product is written n! (read “n factorial"). Thus, the 
number of permutations of the numbers 1, 2, ..., n is n! 


Suppose we have a permutation Ji, 2, ..., Ja of the numbers 
1, 2, ..., n. We denote it by J and write 


J= i, Jz, sess Jn) 


An inversion in a permutation J exists when a larger number 
precedes a smaller number. 
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Example. The permutation (3, 2, 1, 4) has three inversions: 
G, 2), G, 1), @, D. 

We use o(J/) to denote the total number of inversions in the 
permutation J. The permutation is said to be even if the num- 
ber o(J/) is even, and odd if c(J) is odd. 

In the example just discussed the permutation has three in- 
versions and consequently it is odd. Note that the permutation 
(1, 2, 3, ..., n) has no inversions or, in other words, has zero 
inversions. Consequently, this permutation is even. 

3. Definition of nth-order determinants. 

We shall consider a square array formed from n x n num- 
bers. Such an array is called a square matrix of order n. The 
number from the ith row and jth column of the array is denoted 
by aij. 

We have the matrix 


d11 di2 Ain 
a21 022 ... Gin (2) 
anl an2 ann 


The numbers ai, ..., Gann forming the matrix are called its 
elements. 

Matrix (2) has n? elements. We choose n elements such that 
(1) they are in distinct rows (that is, we take one element from 
each row), and (2) they are in distinct columns (this means that 
we take one element from each column). We agree to call such 
a sequence of n elements of the matrix admissible. 

An example of an admissible sequence is &11, 422, -.., Ann. 
In order to obtain an admissible sequence we choose an ar- 
bitrary element of matrix (2) and then delete the row and 
column of the element. As a result we get a matrix of order 
n — 1. We choose an arbitrary element in this matrix and delete 
the row and column of this element. In the resulting matrix 
we again choose an element, and so on. All in all we choose 
n elements of matrix (2). These elements are from different 
rows and different columns of the matrix. 

Let us consider an admissible sequence of n elements of 
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matrix (2). We arrange the elements of the sequence in a 
definite order: beginning with an element a,j, from the first 
row, then an element a, from the second row, and so on. 
Thus we have 


jj, Mj» .... Ünj, (3) 


where Ji, j2, ..., ja are the numbers of the rows of the elements 
being chosen, and by definition they are distinct. Consequently, 
Jis J2, -.., Ja iS an ordered sequence of numbers 1, 2, ..., n; 
we denote it by J. Thus 


Ji J, ones Jn) (4) 


is a permutation of the numbers 1, 2, ..., n. 

Now if for each admissible sequence (2) we form the 
products a@jj,42j,...@nj, Of all its elements, multiply the 
products by either +1 or —1 depending on whether the permu- 
tation J is even or odd, and then add the results, we obtain 
the expression: 


$3 - D*P ay, aij. . -anj (5) 
J 
which is the determinant of matrix (2) or simply an nth-order 
determinant. The letter J below the summation sign indicates 
that summation is carried out over all possible permutations 
(4) of the numbers 1, 2, ..., n. The products d1j,25j,. . .a5j, in 
(5) are the terms of the determinant. There are n! terms in an 
nth-order determinant. 
An nth-order determinant is written as 


Qj a12 ... Qin 
a21 Q2» ... Grn 
ni Qn2 Ann 


The vertical lines mean that we are considering the determinant 
and not an array of n? numbers, that is, we are concerned with 
a number composed in a particular way from the array (expres- 
sion (5)). 


5—366 
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To summarize we formulate the following definition. 
Definition. The determinant of matrix (2) is the expression 


Qj di2 ... Gin 
a21 0822 ... Q2n| = L- 1) Paya. - Anj, (6) 


where the summation is carried out over all permutations 
J= (ji, Jo, ..., Ja) Of the numbers 1, 2, ..., n. 

Matrix (2) is often denoted by A and its determinant by |A| 
or det A*. Sometimes we shall use the notation A. 

4. Specific cases when n = 2 and n = 3. 

Before studying nth-order determinants, it is necessary to 
verify that for n = 2 and n = 3 formula (6) leads to the familiar 
second- and third-order determinants. Products of form 
Aijaz, are the terms of the second-order determinant 


11 412 


421 22 


where (jı, /2) is any permutation of the numbers 1, 2. There 
are two such permutations: (1, 2) and (2, 1), the first being an 
even permutation (it has zero inversions). Consequently, 

1 a» 


= 41422 — di2021 
42, 022 


which coincides with the known expression for a second-order 
determinant. 

The product a1),a2;,a3;, are the terms of the third-order de- 
terminant 


M1 di2 413 
021 022 023 
431 032 d33 


where (j1, /2, /3) is a permutation of the numbers 1, 2, 3. There 
are six such permutations, three of which are even, namely, (1, 
2, 3), (2, 3, D), (3, 1, 2), and three are odd, namely, (3, 2, 1), 
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(2, 1, 3), (1, 3, 2). Whence it follows that 


a1 412 413 
a21 022 423 | = 411822033 + 12423431 + 4134214832 
431 032 433 — 4134022031 — 412021833 — 411023432 (7) 


This is the rule we know for evaluating third-order deter- 
minants. 


3.4. TRANSPOSITION OF A DETERMINANT 


We shall now consider an interesting property of a deter- 
minant. 

Theorem. The value of a determinant is left unchanged if 
its rows are written as columns, in the same order. 

For instance, 


-1 -2 -3 -150 
5 7 9|2]|-274 
0 4 6 —-396 
a Suppose that 
411 di2 Gin 
A= |@1 ax Q2n 
Ani an2 Ann 
is the initial determinant, and 
411 412 Ain 
A* = |an ax Q2n 
Oni an2 Ann 


is the result of interchanging of its rows and columns. Let us 
prove that A = A*. 

We shall now introduce an additional concept, the inverse 
permutation. Let J be a permutation of the numbers 1, 2, ..., 
n. Let us form a new permutation, which is inverse to J, and 
denote it by J^ ! using the following rule. If a number j is in 


3* 
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the ith position in the permutation J, then in the permutation 
J^! the number i is in the jth position. 

For example, the permutation (3, 6, 4, 5, 1, 2) is the inverse 
of the permutation (5, 6, 1, 3, 4, 2). 

Let us show that o(J) = o(J~ 5), that is, the inverse permuta- 
tion has the same number of inversions as the initial permu- 
tation. 

Indeed, suppose that numbers k and /, which are in the ath 
and fth positions respectively in a permutatio J (with a < 8) 
form an inversion, ie. k > /. The numbers a and f are in the 
kth and /th positions respectively in the permutation J ^ '. Since 
I< k but 8 >a, the numbers a and @ form an inversion in 
J^. Thus, each pair of numbers forming inversion in the initial 
permutation is associated with a similar pair in the inverse per- 
mutation. We can prove by analogy that pairs not forming an 
inversion in the initial permutation correspond to similar pairs 
in the inverse. Whence it follows that the number of inversions 
in the permutation J coincides with the number of inversions 
in the permutation J^ !, ie. o(J) = o(J^ )). 

Let us prove that A = A*. The determinant A is the sum of 
n! products of the form 


(7 0*9 ajaj. . .dnj, (1) 


where o(J) is the number of inversions in the permutation 
Jr Jas etat Jn): 

Let us consider expression (1) in terms of the determinant 
A*. We interchange the factors in the product 41),@2;,. . .@nj, SO 
that the first element is an element from the first row of the 
determinant A* (ie. from the first column of A), the second 
element is an element from the second row of A*, and so on. 
The result will be 21/25. . -anj = @i1@i2-. -@i,n. The permuta- 
tion J = (i, b, ..., in) is the inverse of J = (Ji, jo, .... Jn) 
indeed if in the permutation J a number q is in the pth position, 
then expression (1) contains the factor ap;, and consequently 
pis in the qth position in the permutation J. Whence it follows 
that o(J) = o(J), and thus 


(21) Paja. . -anj = (21) 9 aida. . -ain 
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The left-hand side of this equality is a term of the deter- 
minant A with the sign it has in the determinant, and the right- 
hand side is the term of the determinant A* having the cor- 
responding sign. Thus the determinants A and A* have the same 
terms taken with the same signs. Whence it follows that 
A= A* m" 

The operation of interchanging the rows and columns of a 
determinant is called a transposition (A* is the transpose of 
A). Therefore, the theorem just proved can be formulated thus: 
the value of a determinant is left unchanged under transpo- 
sition. 

The property that a determinant is unchanged upon inter- 
changing rows and columns shows that in a certain sense rows 
are equivalent to columns, namely, any statement concerning 
a determinant formulated in terms of rows remains valid if the 
word “row” is replaced by the word “column”. We shall use 
this statement in Sec. 3.6 when deriving the principal properties 
of determinants of order 7. 


3.5. EXPANSION OF A DETERMINANT BY ROWS 
AND COLUMNS 


The initial formula 


Qj @i2 ... Gia 

421 022 ... Qran| = 3-0" aya. . -anj (1) 
cerit cuum MM: J 

Oni Qa2 ann 


is not suitable for evaluating nth order determinants since it 
has n! terms and as n increases this number grows rapidly (for 
instance, whereas a fourth-order determinant has 24 terms, a 
fifth-order determinant has 120 terms). 

In practice, determinants are usually evaluated from formu- 
las for the expansion of determinants by rows (columns), which 
are given below. 
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Let us consider an nth order determinant 


ail Q@2 Qin 
A= | 022 An 
ni an2 Ann 


Let i be a number from 1, 2, ..., n. Each term of the deter- 
minant contains one element of the ith row as a factor. We com- 
bine all terms involving a; (the first element of the ith row), 
put before the brackets the common factor aj;, and denote the 
expression in brackets by Ai. Then we combine all terms con- 
taining a (their sum is @;2Aj2), and so on. As a result the sum 
in (1) decomposes into n parts: 


ain, nAi, ..., AinAin 
Consequently 
A = ani + anin t... + QinAm (2) 


Equation (2) is the expansion of a determinant in terms of the 
elements of the ith row (or simply the expansion by the ith row). 
Expression Aj is called the cofactor or a signed minor of the 
element aj; in the determinant A. 

Thus, a determinant is equal to the sum of the products of 
elements of any row by their cofactors. 

As an example let us expand a third-order determinant by 
the second row. From formula (7) in Sec. 3.3 we have 


11 012 013 
a21 Q22 023 | = &21(413432 — 412033) + a22(a11033 
a31 032 033 — 413031) + a23(d12031 — 211432) 


The expressions in brackets are the cofactors A21, 4422, and A23. 

Remark. Any term of a determinant having an element aij 
as a factor cannot contain any other element from the ith row 
and the jth column of the determinant as a factor. Whence it 
follows that the expression for Aj; does not involve elements 
from the ith row and the jth column of the determinant. In 
other words, the number Ai; is completely defined by elements 
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from other (not the ith) rows and other (not the jth) columns 
of the determinant. 

The proof of the statement concerning columns is quite simi- 
lar. Let j be a number from 1, 2, ..., n. Each term of the deter- 
minant contains an element from the jth column as a factor. 
We combine all the terms involving a;; (the first element of the 
jth column). Their sum is equal to ai;Aij. Then we combine 
all terms containing a2; (their sum is equal to a2;A2,), and so 
on. The result is 


A = aiyjÁij + d2jÀ2j + ... + AnjAnj (3) 


which is the expansion of the determinant A in terms of the 
elements of the jth column (or simply the expansion by the 
jth column). 

Thus, a determinant is equal to the sum of the products of 
elements of any column by their cofactors. 

Formulas (2) and (3) can be used for evaluating the deter- 
minant A. But we must know how to find cofactors. To do this 
we first deduce some properties of rith-order determinants. 


3.6. PROPERTIES OF nth-ORDER DETERMINANTS 


Properties 17-7? of a determinant concern rows. 

1°. If all the elements of a row of a determinant are zero, 
the determinant is zero. 

In order to prove the statement, it is sufficient to expand a 
determinant by that row. 

29. If any two rows of a determinant are interchanged, the 
determinant is multiplied by —1. 

o We have to distinguish between two cases. 

Case 1. Interchanging neighboring rows. Suppose that we in- 
terchange two neighboring rows, say the first and second, in 
the determinant: 


a11 a12 ... Qin 
A = |]@21 d22 ... Gan 
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The result is the new determinant 


Gp 012 ... Gin 
A’ = |dz 422 ... Arm 
Oni an2 Ann 


Let us show that A’ — A. We consider some term of A: 


Qj, 2j, . -nj, (1) 


The factors 415, .. ., anj, Of this product are also the elements 
of A' and they belong to different rows and different columns. 
Consequently, any term of the determinant A is also a term 
of A'. The converse is also true: any term of the determinant 
A' is also a term of A. Thus, the determinants A and A' contain 
the same terms. In order to prove that A’ = — A, it is sufficient 
to verify that the sign (+ and —) of the term of the determinant 
A is opposite to the sign of the term of the determinant A'. 

The sign of the product in (1) of A is defined by the factor 
(- 0*9, where J = (ji, j2, ..., Ja). In order to find the sign 
of the same term of A', we should write the product in (1) so 
that the first be an element from the first row in A', the second 
be an element from the second row, and so on. We know how 
to obtain the determinant A' and so we write (1) in the form 
d2j,01j,03j,. ..dnj,. The sign of this product in A’ is defined by 
the factor (—1)"9), where J’ = (fo, jis Ja, «+> Ja). 

When passing from J to J’, we have interchanged jı and j2. 
As a result the number of inversions in a permutation should 
change (increase or decrease) by 1. Indeed, if jı < j2, then a 
change from J to J’ results in a new inversion (/;, jı), and if 
j2 € jı, then the change eliminated the inversion (j1, j2), other 
inversions being unchanged. Thus, the numbers o(J) and o(J") 
differ by unity, whence it follows that (—1)"? = —(—1)'9), 
This means that the sign of the product (1) in the determinant 
A is opposite in sign to that in A'. 

Case 2. Interchanging the rows which are not neighbors. Sup- 
pose that we interchange the ith and the jth rows, with i < j. 
This can be done in several steps. First we interchange the ith 
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and the (i + 1)th rows. Then we interchange the (+ 1)th and 
(i + 2)th rows in the resultant determinant, and so on until the 
ith row of the initial determinant is in the /th position. Then 
we shift the initial jth row “upward”. (it is now in the (j — 1)th 
position) until it is in the ith position. All in all, we should 
accomplish 2(j — i) — 1 interchanges of neighboring rows 
(j — i when shifting the ith row “downward” and j —i- 1 
when shifting the jth row upward"). Upon each interchange 
the determinant is multiplied by — 1; the total number of inter- 
changes is odd, and therefore the resulting determinant is multi- 
plied by —1. a 

Remark. The proof of property 2° shows that the initial de- 
terminant A and the determinant A’ obtained from A by inter- 
changing two rows consist of the same terms but taken with 
opposite signs. 

3°. If a determinant contains two identical rows, it is zero. 

B Indeed, if we interchange two identical rows in a deter- 
minant A, we should obtain the same determinant A; on the 
other hand under such an interchange the determinant must 
be multiplied by —1. Consequently, A = — A, whence 2A = 0 
or A=0. uw 

Example. The determinant 


=la 23 
4 -56 
4-56 


is zero since its second and third rows are identical. 

49. A common factor of the elements of any row can be taken 
outside the determinant. 

u For instance, let the elements of the first row have a com- 
mon factor k, namely kai, kai2, ..., Kain. Then 


Y hee M do à ioe wees | = Kay Ay + kazin +... + KainAin 
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ü11 12 Qin 
@21 022 ... Mn 

= k(ay Ait + anA +... + AtnAin) | (eene z 
Ani Qn2 Ann 


5°. If each element of a row is a sum of two addends, then 
the determinant can be represented as a sum of two deter- 
minants in each of which all the elements are the same as in 
the initial determinant except for the elements of the indicated 
row. The indicated row in the first determinant contains the 
first addends and that in the second determinant contains the 
second addends. 

For instance, 


bi + ca Diz + C12 ... Din + Cin 
a21 a22 eee Q2n 
Oni Qn2 Ann 
bii biz... Bin C11 C12 ... Cin 
d 422 Azn] + |G21 422 A2n 
Ani an2 Ann Oni Am Ann 


In order to prove this, it is sufficient to write 
(bii + c11) Aui + (bi2 + c12) A12 + ... + (Din + Cin) Ain 
= (by An + bAi + ... + binAin) 
+ (ciiAn + C12A12 + ... + CinAin) 
69. The value of a determinant is left unchanged, if the ele- 
ments of a row are altered by adding to them any constant mul- 


tiple of the corresponding elements of any other row. 
For instance, 


Q11 812 ... Qin 
821 022 ... Q2n 
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dii + kaz an + kazn ... Qin + Kaan 
21 22 sa 2n 
ani ün2 n Ann 


Indeed, according to property 5°, the determinant on the right- 
hand side of the equality can be represented as 


ü11 A12 ... Ain kazı kar kox Karn 

G2, 422 ... Gan| + 21 022 ... an 

Ani Q@n2 «++ Ann Ani an2 Ann 
Aii 812 ... Ain a21 A22 ... Qn 
d231 @22 ... Qn| + k | az 422 ... an 
Ani Qn2 Ann Ani Am Ann 
a1 012 ain 
421 022 On 
üni an2 Ann 


(the determinant containing the factor k is zero since it has two 
identical rows, the first and the second). 
Example. Evaluate the determinant 


2:59 59 
26 57 92 
263 571 920 


O Note that the second row differs "slightly" from the first 
multiplied by 10, and the third row from the second multiplied 
by 10. Therefore, we add the second row multiplied by —10 
to the third row, and the first row multiplied by —10 to the 
second row. The resulting determinant is 


259 
672 
310 
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which, according to property 6°, must be equal to the initial 
determinant. By a direct calculation (say, by the triangle rule), 
we find that the determinant is 109. Consequently, the initial 
determinant is also 109. @ 

A remark must be made before formulating the next 
property. We know that the sum of the elements of a row multi- 
plied by their cofactors equals the determinant. Suppose we 
add the elements of a row multiplied by the cofactors of the 
corresponding elements of another row. For instance, find the 
sum d21À11 + d22À12 + ... + @2nAin, Which contains the ele- 
ments of the second row multiplied by the cofactors of the cor- 
responding elements of the first row. The result can be deduced 
due to the next property. 

7°. The sum of the elements of a row in a determinant multi- 
plied by the cofactors of the corresponding elements of another 
row Is zero. 

o For instance, let us show that 


d31Ái1 + d22À12 + ... + anA = 0 (2) 


For arbitrary numbers bj, b», ..., b, the following equality 
is valid: 


bi b2 ... Da 
2, Q22 ... Gan) = bii + b2Ái2 +... + DnAin 
Ani an2 Ann 


(this is the expansion of the determinant by the first row). We 
assume that bı = a21, b; = @22, ..., bn = Gon and find that 


11 012 ... Gin 
0231 A22 ... Q2n| = dz1AÀ11 + d422À12 + ... + AmnÁin 
Oni an2 ann 


The determinant on the left-hand side is zero since it contains 
two identical rows, the first and the second row. Equality (2) 
is proved. m 


——— 


Ch. 3. Determinants 77 


Example. Given the determinant 


3 21 
1:572 
4 -33 


the sum of the elements of the first row multiplied by the cofac- 
tors of the corresponding elements of the third row is 


2:] 31 3 2 


-52 12 test. 


J: =. 


+r] 


3.7. MINORS. EVALUATION OF nth-ORDER DETERMINANTS 


1. Minors. 
Let us consider an nth-order determinant 


Q1 a12 ... Ain 
A= |i d22 ... Qn 
dni Qn2 Ann 


We c.i00se one element aj. If we delete the ith row and the 
jth column (i.e. the row and the column of the element a;j) from 
A, we obtain a determinant of order n — 1. This is called the 


„minor of the element aij in the determinant A and is denoted 


by My. For instance, the minor of the element a23 = —2 in the 
fourth-order determinant is 
1-1 7 1-1 7 


M»3-2|0 0 6|-2]|0 0.6|-20 
0 0-2 0 0-2 


2. Relation between Minors and Cofactors. 

Let us prove the following fundamental theorem. 

Theorem. 77e cofactor of any element ai; of the determinant 
A is equal to the minor of the element multiplied by (— 1) Y, 
that is, 


Aj = (-D'* My (1) 
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In other words, the same equality Ay = +M; is always true, 
the plus sign being valid when the sum i| + j is even and the 
minus sign when it is odd. 

n First we prove formula (1) for the special case when i = n, 
Jj = n. Since the number n + n is even, we need only verify that 


Ann = Man (2) 


According to the definition of a cofactor, in order to find 
Ann We must combine all the terms of the determinant A which 
involve the element ann as a factor, that is, the terms of the form 


dj 2j. . Un — 1, h-inn (3) 


The sign of such a term in A is defined by the factor (— 1)? , 
where J = (J1,J2, .. ., Jn - 1, n), Eliminating the number n from 
the permutation J, we obtain a permutation J’ = (ji, jz, ..., 
Jn-1) Of the numbers 1, 2, ..., n — 1. The number n is the 
largest of the numbers 1, 2, ..., n — 1, and therefore it does 
not form an inversion with either of the numbers ji, j2, ..., 
Jn—1 in the permutation J; consequently, the number of inver- 
sions in J coincides with that in J', that is, 


o(J) = o(J^) (4) 


If we factor out 2,4 from all terms of form (3) (each taken 
with the respective factor (—1)°™) the expression in brackets 
will be equal to Ann. Noting (4), we thus have 


Ann = 2,( 1)" ajj.. 0n 1,5 
F 


where J’ is any permutation of the numbers 1, 2, ..., n — 1. 
The sum on the right-hand side is the determinant 


aa a2 01,n-1 
21 022 02,n-1 
On-1,1 Qn- 1,2... Q@n-1,n-1 


that is, the minor M,,. Equality (2) is proved. 
Let us now consider the general case when | and j are any 
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integers from 1 to n. We reduce this to the special case of i = n, 
J = n discussed above. 

We interchange the ith and (i + 1)th rows in the determinant 
A. Then we interchange the (i + 1)th and (i + 2)th rows in the 
resulting determinant, and so on, until the ith row of A changes 
places with the nth last row. The element aj; in the resulting 
determinant appears in the nth row and jth column. Then by 
interchanging neighboring columns (first the jth and the 
( + I)th, then the ( + Ith and (j + 2)th columns, and so on), 
we shift the element aj; “to the right" until it appears in the 
bottom right corner of the determinant, i.e. in the mth row and 
nth column. We have made 2n — i — j interchanges of neigh- 
boring rows and columns (n — i when shifting the ith row 
“downward” and n — j when shifting the jth column “to the 
right"). These interchanges leave the determinant unchanged, 
while changing its sign. Given that the number 2n — (i + j) is 
even or odd as i + j is even or odd, we infer that,as a result 
of the interchanges, the terms in the determinant A are multi- 
plied by (—1)/*7. 

The minor M,, in the resulting determinant A’ is the same 
determinant of order n — 1, as the minor Man. Thus, the cofac- 
tor Aj; is equal to (—1)'*7Aj. Given that Aj, = Man (from 
what we have just proved), we have 

(-D'"Ay = My 
and this is essentially the same as (1). m 
3. Evaluation of nth-order determinants. 
In practice, nth-order determinants are evaluated from the 


formulas for expanding a determinant by a row or column. For 
instance, expanding a determinant A by the ith row we have 


A = aiAü + dnÁi + ... + AinAin 


or noting formula (1) 
A = an(-1)'*!Mi  ap( - D* Ma + ... + ain(—1)'* "Min 


This equality reduces the evaluation of the rth-order deter- 
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minant A to the evaluation of a set of (n — 1)th-order deter- 
minants Mii, Mi», ..., Min. 

The formula for the expansion of a determinant by a row 
(or column) is simpler when all the elements of this row (or 
column) are zero except for one element, say aij: 


A= dijÁi 
or 
A = ay(—1)'*/My 


Thus to evaluate the nth-order determinant A means to evaluate 
a single (n — 1)th-order determinant. 

Although a given determinant A may not contain a row (or 
column) with the required number of zeros, we can always 
transform the determinant, leaving its value unchanged, so that 
the elements of the row (or column) we have chosen are zero 
except for one element. To do this we use one of the properties 
of a determinant, namely, the value of a determinant remains 
unaltered if the elements of one row (or column) are altered 
by adding a multiple of the corresponding elements of any 
other row (or column), or in short, if a row (or column) multi- 
plied by an arbitrary number is added to another row (or 
column). A concrete example illustrates how a determinant can 
be transformed to the desired form. 

Example. Evaluate the fifth-order determinant 


1 3 
0 =2 
A E 
1 -1 
222 


O We choose the second row since it contains more zeros 
(there are two). We shall try to transform the determinant 
without changing its value, so that all the elements of the sec- 
ond row are zero, except for d24 = 1. Obviously, it is sufficient 
to add the fourth row multiplied by 2 to the second row and 
the fourth row multiplied by —2 to the third row. The result 
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is the following determinant 


1-12 -21 
0 00 10 
-1 03 -13 
1-13 01 
221 2 


which equals the initial one. We expand it by the second row 
and obtain 


[13 
-1 033 
1 -131 
2 271 


We should choose the second row in the last determinant since 
it contains zero. Let us transform the determinant so that all 
the elements of the second row are zero, except for az = —1. 
By subtracting the first row from the third one and the first 
row multiplied by 2 from the fourth one we obtain the deter- 
minant 


le i 
-1 03 3 
0 01 0 
0 03-1 


which equals the initial determinant A. Its expansion by the 
second row yields 


-13 3 -13 3 
A=(-1X-1)!*?]| 01 o|2]| 01 0 
03 -1 03 -1 


This third-order determinant can be directly evaluated, but to 
expand it by the first row is much simpler: 
A-(-0(C ptt d 


(U x 
ELI, 


6—366 
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3.8. CRAMER’S RULE FOR AN 7 x n SYSTEM 


An important application of rith-order determinants is solv- 
ing a system of n linear equations in n unknowns. We shall 
call such a system an n X n system. 

Let us introduce some notation. We shall use the same letter 
x with different indices to denote different unknowns, viz. xi 
is the first unknown, x2 is the second one, and so on. The 
coefficients of the unknowns in an ith equation are: qj is the 
coefficient of x1, aj; is that of x2, and so on. So the ith equation 
of the system is written as 


aiX| + diaX2 + ... + GinXn = bi 


where n is the number of unknowns in the system. 
We shall write an n X n system as 


auxi + diaX2 + ... + AinXn = Dy 
231X1 + d22X2 + ... + GanXn = bz (1) 
Gn1X1 T ün2X2 ae oF AnnXn = bn 


The determinant 


@i1 di2 ... Gin 
A = |a 42 ... Gan (2) 
an1 an2 Ann 


formed from the coefficients of the unknowns in system (1) is 
called the determinant of this system. 

A solution of system (1) is the set of values of the unknowns 
X1 = Q1, X2 = 02, ..., Xn = a, Such that it satisfies every equa- 
tion of the system. 

The following theorem, which is called Cramer’s rule for an 
n X n system, is valid and it is similar to the theorem we have 
proved in Sec. 3.1 (Cramer's rule for a 2 x 2 system). 

Theorem (Cramer's rule for an n x n system). If the deter- 
minant A of an n X n system is nonzero, then the system has 
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a unique solution, which can be found from the formulas 


A A A 
maz) =e mae (3) 
where each determinant Ai, A», ..., An is obtained Jrom the 
determinant A by interchanging the corresponding column (the 
first, second, .. ., nth) by the column consisting of the constant 
terms of the equations: 


bi az Gin air by ain 
Ar = | b2 ax an|, Az = |dn bz Gn |, 
Ds an2 ann ani bn ann 


o First we prove that if a solution x1, X», ..., Xn exists for 
system (1), then it is satisfied by formulas (3). Then we show 
that the values of the unknowns defined by (3) are valid for (1). 

(D) Suppose that each of the equalities in (1) is satisfied by 
some numbers Xj, X», ..., Xn. We multiply both sides of the 
first equality by A11, those of the second by A21, and so on, 
and both sides of the nth equality by Anı. In other words, we 
multiply the equalities in (1) by the cofactors of the first column 
of the determinant A. Summing these equations yields 


Anu(anxi + dix) +... + QinXn) 
+ Áa(G2)1X3i + d22X2 + ... + G2nXn) 
+ An (Qn1X1 + Gn2X2 eee + GnnXn) 


= bian + D2421 + ... + brAÁnı 
By combining the terms containing first xı, then xz, and so 
on, we rewrite the equation in the form 
(anA + anA +... + GQntAnt) xi 
+ (aizAn + @22A21 + ... + Gn2Ant) X2 


+ (dini + QnmnÁn +... + GnnÁ ni) Xn 
= byÁAn + Db24531 +... + bn Am (4) 


6* 
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The coefficients of xi, X2, ..., Xn can be easily simplified using 
the properties of a determinant. For example, the coefficients 
of xı is the sum of the elements of the first column in A by 
their cofactors; consequently, this coefficient is A. The coeffi- 
cient of x is the sum of the products of the elements of the 
second column in A by the cofactors of the corresponding ele- 
ments of the first column; consequently, this coefficient is zero 
(property 7° of determinants). Similarly, the coefficients of x3, 
..., Xn are zero. As a result, the left-hand side of (4) is equal 
to A-x,. The right-hand side of (4) is the expansion of A; by 
the first row; consequently, the right-hand side of (4) is equal 
to A,. Thus, it follows from (4) that 


Ax = Ai 
and noting that A # 0 we have 


Mcr. 


We have obtained the first formula in (3). 

In order to obtain the second formula from (3) we multiply 
both sides of the first equality in (1) by A12, those of the second 
by A22, and so on, and those of the last by Anz. In other words, 
we multiply the equalities in (1) by the cofactors of the elements 
of the second row in the determinant A. Adding these equalities 
yields 


A:X2 = A2 


ie the second formula in (3). 

The remaining formulas of (3) can be obtained in a similar 
way. 

(II) Let us now check whether the values of the unknowns 
defined by (3) satisfy (1). We verify this for the first equation. 
We have 

Ai An 


a pete Se 
eK A eee In 
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[ad1(b1A11 + b2421  ... + brAnı) + a12(b1A)2 


a 
+ DB2A22 + ... + BnÁn2) + ... + Qin( Ain + D2A25 
+... Dana) 

1 


A lain + dg... + QinAin) + b2(a11421 


+ a2A22 + ... + GinAan) + ... + ba(QiiAn 
+ @i2An2 + ... + QinAnn)). 


The factor of bı on the right-hand side of this equality is equal 
to A since it is the expansion of A by the first row; the factor 
of b; is zero since it is the sum of the products of the elements 
of the first row in A by the cofactors of the corresponding ele- 
ments of the second row; similarly, the factors of the remaining 
b; are also zero. Consequently, we have 


1 i 
iA = L| 


Remark. Formulas (3) are seldom used since they require 
evaluating the determinants A, Ai, ..., An. A practical method 
for the solution of system (1) is Gaussian elimination (see 
Chapter 9). 

Example 1. Solve the 3 x 3 system 


Xtytz-2 
4x -2y c z 2 —4 (5) 


9x -3y -z- -8 


We have 
111 
AS|421|22z0 
931 
and therefore the system has a unique solution. We find that 
—211 1-21 
A= | -421|=-2, =| 4-41) =2 


-831 9 -81 
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11 -2 

A3= |42 -4] = -4 
93 -8 


Consequently, x = -2/22 -1, y=2/2=1, z= -4/2= 
-2. 

Example 2. Given three points in the plane: Mi(1, —2), M2, 
—4), M3(3, — 8), find the coefficients p, q, r such that the graph 
of the function y = px” + qx + r passes through Mi, M^», and 
Ms. In other words, draw a parabola through these points. 

© The condition for the parabola to pass through Mi(1, 
—2) is 


-2=p-Pigqlt+r 


The similar condition for M2(2, —4) is 


—4=p24q2+r 


and for M3(3, —8) is 
-82p34q3-sr 


We have a system of three equations in three unknowns 


pt+qtr=-2 
4p+2qt+r=—-4 
9p + 3q +r= -8 


which only differs from system (5) in Example 1 by the notation 
for the unknowns. Whence we have p = —1, q = 1, r= —2, 
and the desired function is y = —x? + x + 2. @ 


3.9. A HOMOGENEOUS n x n SYSTEM 


Definition. A linear equation is said to be homogeneous if 
its constant term is zero. A system composed of homogeneous 
equations is homogeneous. 

Let us write a homogeneous system of n equations in n 
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unknowns in the general form 


diiXi + di2Xo 4o... + AinXn = 0 
@21X1 + d22X2 + ... + G2nXn = O (1) 
GniXi + Gn2X2 + ... + AnnXn = 0 


Setting xy = 0, x2. = 0, ..., Xn = 0 we obtain one of the solu- 
tions of a homogeneous system. This solution is called a zero 
(or trivial) solution. It is frequenly necessary to know whether 
a homogeneous system has nontrivial solutions, ie. such that 
at least one of the numbers x1, x2, ..., Xn is nonzero. The fol- 
lowing theorem gives an answer. 

Theorem. A system of n homogeneous equations in n 
unknowns has a nontrivial solution if and only if the deter- 
minant is zero. 

n As usual we use A to denote the determinant of a system 


a11 412 ... Gin 
A= 021 822 ... Qn 
Ani Am ... Ann 


We must prove two statements. 

(1) If system (1) has nontrivial solution, then A = 0. 

(2) If A = 0, then system (1) has a nontrivial solution. 

Let us prove the first statement. Suppose that system (1) has 
a nontrivial solution œ1, o2, ..., Œn. Then the system has at 
least two distinct solutions: (o1, a2, ..., o») and (0, 0, ..., 
0). Whence it follows that A = 0. Indeed, if A # 0, then as we 
know the system has a unique solution. 

Let us now prove the second statement. We use induction 
on n. For n = 2 we have the system 


@11X1 + di2X2 = 0 
21X1 + d22X2 = 0 


whose determinant, by the hypothesis, is zero: 


811 d12 
= 411422 — di2021 = 0 


@21 G22 
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This equation implies that any pair of numbers, (a2, —411) 
Or (422, —421), is a solution of the system (check it). Therefore, 
if there are zeros among the numbers aii, 412, 021, 022, then 
a nontrivial solution exists. Now, if all four numbers are zero, 
then any pair of numbers x1, x2 gives the solution; consequent- 
ly, nontrivial solutions exist in this case too. 

Let us now suppose that the second statement is valid for 
a system of n — 1 equations; we prove the statement for a sys- 
tem of n equations. 

We shall not consider a system of the form 


O-x1 + 0x2 +... + O-x, = 0 
Ox + 0x2 +... + OX, = 0 


since any set of the numbers xi, X2, . . ., Xn provides its solution. 
Consequently, we may assume that at least one of the coeffi- 
cients of the unknowns in (1) is nonzero. Two cases are possible 
here. 

Case 1 when ai; # 0. We add the corresponding sides of the 
first equation multiplied by —42:/a11 to both sides of the sec- 
ond equation in (1). We obtain 


Q21X1 + d22X2 + ... + AanXn 


021 
m (axı + d12X2 + ... + AinXn) = 0 
11 


ie. an equation of the form 


0j3X2 + ... + Apr = 0 


(the term containing xı is absent). Similarly, adding the cor- 
responding sides of the first equation multiplied by —a31/41ı 
to both sides of the third equation in (1) yields 


a32X2 + ... + d44X. = O 
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and so on. As a result system (1) assumes the form 


Q11X1 + di2Xo + ... + AinXn = O 
dj) + 2... + afnXn = 0 (^) 


Q2X2 o... + dX. = 0 


System (1 ^) is equivalent to system (1), that is, it has the same 
solution as (1). Indeed, since (1’) is derived from (1), any solu- 
tion xi, X2, ..., Xn Of (1) satisfies system (1^). The converse 
is also true, that is, system (1) can be derived from (1^) (to do 
this we multiply the first equation in (1^) by @21/a1; and add 
the result to the second equation, then multiply the first equa- 
tion by a31/21; and add the result to the third equation, and 
so on), hence it follows that every solution of system (1^) satis- 


fies (1). 
Let us consider the determinant of system (1^) 
Ait di)... Ain 
A'-| 0 ay jn 
0 an ann 


Obviously, this can be obtained from the determinant A in the 
same way as system (1^) was obtained from system (1), namely, 
we multiply the first row by —a25:/ai; and add the result to 
the second row, then multiply the first row by —a3;/a1; and 
add the result to the third row, and so on. Each of these opera- 
tions leaves the determinant unchanged (property 6" of a deter- 
minant). Consequently, A’ = A. But by the hypothesis A = 0 
and, consequently, A’ = 0. 
Expansion of A’ by the first row yields 
dj... Aan 
73053 CT EE ET TS T =0 


ax oes Mn 
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According to the inductive assumption, this means that an 
(n — 1) x (n — 1) system 


Qu2X2 t... + AnnXn = 0 


which is a part of system (1^), has a nontrivial soluton. Let 

the solution be x2 = o2, ..., Xn = On. Having found x; from 

the first equation in (1 ’) (a 2 UB Q2—...— Sir 2 ,we 
aiu au 

get a complete solution o, a2, ..., o of system (1^) and there- 

fore of system (1). 

Case 2 when aj; = 0. By interchanging, if necessary, the 
equations in system (1) (as a result the rows in the determinant 
will interchange and it becomes zero), we get the first equation 
with at least one of the coefficients of the unknowns to be zero. 
If it is the coefficient of x1, we have Case 1. If it is the coeffi- 
cient of, say, xz, we must interchange the unknowns, namely, 
substitute x; for xı and xı for xz (the first and second rows 
interchange in the determinant and it remains Zero). As a result, 
the coefficient of x in the first equation becomes nonzero, i.e. 
we again arrive at Case 1. m 

Example 1. Find whether the system has a nontrivial so- 
lution: 

x-3y+2z=0 
2x- y+ z=0 
8x* y- z=0 


O We find the determinant of the system 


iş 2 
A=|2 -1 1 aa 1 
8 1 -1 EE 
2 1 2 -1 
-(-3 E dE. 


Since A z 0, the system has a unique solution, x = 0, y = 0, 
z= 0. There is no nontrivial solution. 6 
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Example 2. Find whether the system has a nontrivial so- 
lution: 


(check this). Thus the system has a nontrivial solution. @ 
Example 3. Find the value of a at which the homogeneous 
system 
ax+ y+5z=0 
7x —- ay * z=0 
—3x *2y - 3220 


has a nontrivial solution. 
O The determinant of the system 


a 15 
A= 7 —a 1| = —3a* — 17a + 46 
-3 23 


depends on a. A nontrivial solution exists if and only if A = 0, 
i.e. if 3a? + 17a — 46 = 0. By solving this quadratic equation, 
we find its two roots, a, = 2 and a; = —23/3. The system has 
nontrivial solutions at these values of a only. 6 


3.10. A CONDITION FOR A DETERMINANT TO BE ZERO 
We first consider a second-order determinant 


a1 i2 
2| 022 


A= 


Each row of such a determinant is a sequence of two numbers. 
In a Cartesian coordinate system in the plane, each row is as- 
sociated with a vector. 
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Similarly, each row of a third-order determinant 


@i1 d12 413 
Q = | anı ax az 
31 G32 033 


is a sequence of three numbers. In a Cartesian coordinate sys- 
tem in space, each row is again associated with a vector. 

Theorem 1. A second-order determinant is zero if and only 
if its row vectors are collinear. 

Theorem 2. A third-order determinant is zero if and only if 
its row vectors are coplanar. 

Let us first prove the following lemma. 

Lemma. Two vectors pi and p» are collinear if and only if 
one of them is a linear combination of the other. Three vectors 
Pı, P2, and p; are coplanar if and only if one of them is a 
linear combination of the other two. 

n The first assertion is obvious since it means that p; and 
p2 are collinear if and only if the equality p; = Ap; or p2 = Api 
holds true, where ^ is a number. 

Let us prove the second assertion of the lemma. We first as- 
sume that the vectors pı, pz, and ps are coplanar and reduce 
them to the origin. If p; = 0, then pi is a linear combination 
of p; and ps: pi = O-p2 + O-ps. Therefore we assume that 
pi z 0. Two cases are possible here. 

(1) Vector pz is collinear with pı, ie. pz = Api, à being a sca- 
lar. We write this equation in the form p; = Ap; + O-p3 and 
see that one of the three vectors can be represented as a linear 
combination of the other two. 

(2) Vector p; is not collinear with pı. Then pı and pz 
represent a plane, and vector ps, which is coplanar with p, and 
p2 should lie in this plane, and this means that ps can be 
represented as a linear combination of p; and pz (see Item 1 
in Sec. 1.2). 

Conversely, suppose one of the vectors pi, pz, ps is a linear 
combination of the other two, namely ps = ^pi + up». It is ob- 
vious that pı, pz, ps are coplanar. m 

D Let us now prove Theorem 1. We use p; and p» to denote 
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vectors corresponding to the rows of the determinant (pı cor- 
responds to the first row and pz to the second). We must prove 
two statements: 

(1) if vectors p; and p; are collinear, then A = 0, and 

(2) if A = 0, then p; and p» are collinear. 

Let us prove the first statement. Suppose that pı and p; are 
collinear, then, according to the lemma, one of them is a linear 
combination of the other, namely, p? = Api, \ being a number. 
This means that a21 = Mni, @22 = M12. It follows that 
ai a 


= diidz) — di2d21 = (411412 — 42411) = 0 
2| 422 


the proof is complete. 
Let us prove the second statement. Suppose A = 0. 


Let us consider the following system of equations 


(1) 


àixi + d21X2 = 0 
aiixi + d22X2 = O 


Its determinant 


11 21 
12 022 


a 


is obtained by transposing A. Consequently, Æ is also zero, and 
this means, according to the theorem on homogeneous systems, 
that system (1) has nontrivial solutions, say x1 = a1, X? = a2. 
Thus, the following equations are valid: 


aia + 42102 = 0, @i201 + 42202 = 0 


where either o; or a2 is nonzero. These equations show that 
if we multiply the first row of the determinant by o; and the 
second by o? and add the results, we obtain the zeroth row. 
In other words, 


œp: + o2p2 = 0 (2) 


Since either a or o; is nonzero, we can represent one of the 
vectors pi, pz as a linear combination of the other. For instance, 
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if œ ¥ 0, then it follows from (2) that 


Thus, p; and pz are collinear. m 
o Let us prove Theorem 2. We use pi, p2, and p; to denote 
the vectors corresponding to the rows of the determinant Q. 
As before, we must prove two assertions: 
(1) if vectors pi, pz, and ps are coplanar, then Q = 0, and 
(2) if Q = 0, then vectors pi, p2, and p; are coplanar. 
Suppose pi, p2, and ps are coplanar, then, according to the 
lemma, one of the vectors is a linear combination of the other 
two. For instance, ps can be represented as a linear combination 
of p; and pz, ie. ps = Api + up, where ^ and y are numbers. 
We write this equation as p3 — Ap: — up? = 0. Hence we can 
see that if we multiply the first row of the determinant Q by 
— and the second row by — p and add them to the third row, 
then we obtain the determinant 
011 d12 13 
Q’ = |a5 a2 023 
0 0 0 


Using the properties of a determinant, we can write Q’ = Q 
(property 6°) and Q’ = 0 (property 1°). Consequently, Q = 0, 
which was required to prove. 

The second statement can be proved in the same way as the 
similar statement of Theorem 1 (we leave it to the reader). @ 

Example. Find whether the vectors p; = (—4, 1, —7), 
p2 = (2, 5, 9), and ps = (—8, 13, —3), defined by their coor- 
dinates in a Cartesian system, are coplanar. 

O We use the coordinate of the vectors to form a third-order 
determinant 


-4 1-7 
9-| 25 9 
-8 13 -3 


Direct evaluation shows that it is equal to zero. Hence it fol- 
lows, according to Theorem 2, that pi, po, and ps are 
coplanar. 6 
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Exercises to Chapter 3 


3.1 
1. Evaluate the determinants: 
7-4 ( 1 5 © cos a — sina 
» s c w 
ta) 5-3 ) ll 55 sin a — cosa 
2. Solve the equations: 
0 x+2-3 
(a) =1, = 
x0 x-2 x 


3. Using Cramer's rule, solve the systems of equations: 
10x —3y + 41-0 3x+ y+5=0 
(a) 


3x +2y- 8-20, x+S5y-3=0, 
xcosa—ysina=a 
xsina + y cosa = b 


4. Prove that the square polynomial ax? + bx + c, with a # 0, is a perfect 
square if and only if 


ab| _ 
be 
3.2 
1. Evaluate the determinants: 
213 -301 0a0 
(a) 15321], (D 1-524], (Qibcd 
143 037 0c0 
2. Solving the equations: 
123 Il o x2 
(9|-1x0| 2-1 (5 x -1x| 50 
324 =§ MES 
a aa atx x x 
œ) |-@ ax| =0, (d) x b+x x =0 
—a-ax x X c+x 


3.3 

1. Find the number of inversions in the permutations: 
(a) (7, 6, 9, 1, 2, 3, 5, 4, 8), (D (n,n — 1,n — 2,...,2, 1). Are the permutations 
in (a) and (b) everi? 
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2. Find whether the product (a) 5524453421032, (D) @51@23@34@45412 is a term 
of a fifth-order determinant. If the answer is affirmative, find the sign of the 
term. 

3. Choose the values of i and k so that the product 24786381/055407x424431 
is a term of a determinant (find its order) and has the plus sign. 

4. Prove that the determinant 


au 0 0 ... 0 
21 d22 Ü. uu 0 
03 d3 a33 ... 0 
Uni Am Ün3 ... Onn 


all whose elements located “higher” than a11, 422, ..., Ann are zeros, is equal 
to the product @11422@33... Ann. 
5. Prove that the fourth-order determinant 


Qu 812 d1 du 
anu az 0 0 
a3, a2 0 0 
Q4 an 0 0 


is zero. 
6. Find the terms containing x* and x° of the determinant 


3.6 


1. Using the properties of a determinant, show that 
(a) the equation 


1x x 
laa*|] =0 
1b Bb 
has the roots xı = a and x» = b, 
(b) each of the determinants 
123 sin? a 1 cos? a sin æ cos a sin (a + ô) 
4561], | sin?8 1cos?B], | sin cos sin (8 + à) 
789 sin? y 1cos?y sin y cos y sin (y + 4) 


is zero. 
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2. Using the properties of a determinant, evaluate 


1 3 6 
12 3 62 
122 315 623 


3. Prove the equation 
L 1] 1 
a b c| =(b- aXc — ate - b) 
Cbg 


both by a direct evaluation of the determinant and by using the properties 
of a determinant. 


3.7 
1. Find the minor of ay in the determinant A 
1 ab 
, i=],j=2, , i=2,j=2, 
(a) su * J (b) eG j 
5-72 1234 
(I9 13], i22,23, (d) |234 1], i24j-4 
4 16 3412 
4123 
2. Expand the determinant 
2-341 
(a) [4 -232 
a bcd 
3 -1 43 
by the third row, and the determinant 
5 a2 -1 
(b) 1404 -3 
2c3-2 
4d 5 —4 
by the second column. 
3. Evaluate the determinants: 
srra 1 2 3 4 0111 
(a) [1311], œ |-2 1-4 3|, œ@ |10 1 1|, 
1131 3-4-1 2 1101 
1 1 13 4 3-2-1 1110 
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01 1 3 2 0-134 -1 2 311 
()]|10 1! 6], (9 0 1 021|, (f 0-3 210 
11 0-2 -1 0 153 2 1-102 
36-2 0 -3 1-202 4 1-270 
-1-1 251 0.0 350 
3.8 
1. Using Cramer's rule, solve the systems of equations: 
x+ 2y t 3227 x+y+z=1 
(a) 42x- y+z=9 (D 42x- 3y +z=1 
x-4y+2z= 11, 4x+y -5z=1, 
2x+y+z+u=5 2x1 + 2x2. — x3 + 4 = 4 
(c) |\x+2y+z+u=4 (d | 4x1 + 3x2 — x3 + 2x4 = 6 
x+y+2z+u=7 8x, + 5x5 — 3x3 + 4x4 = 12 
x+y+z+2u=4, 3x1 + 3x2 — 2x3 + 2x, = 6 


2. Find all the solutions of the system of equations if the determinant of 
the system is nonzero: 


ax + by + cz = axı + bx; + bxs + bx, = CC 

(1) < ax + boy + &z = a2 (2) | bx, + ax. + bx; + ba = c 
a3x + b3y + Ciz = 43, bx, + bx + ax; + ba = 

bx, + bx. + bx; + ax =c 

3.9 

1. Find nontrivial solutions of the system of equations if it has any: 
2x - 4y 2 0 —-x+y=0 ~axt+ytz= 0 

(a) (c) ¢x-2y+z=0 
5x — 10y = 0, 2x + Ty = 0, x+y-2z=0 


2. Find the values of a at which the system has nontrivial solutions: 


x+ 2y- 3z=0 ax+y+z=0 
(a) 43x —-2y+z=0 (D ¢x+ay+z=0 
ax — My + 15z = 0, x+ytax=0 


3. Prove that if the system 


aix + biy = ĉi 
ax + by = c2 
a3X + bsy = C3 
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is consistent, then 


ay bi C1 
az b €2 -0 
ay bi €3 
3.10 
1. Find whether the vectors are coplanar: 
(a) a =¢-1, 4, 2), b= (3, 1, -5, c= (l, 9, -15, 
(b) a = (0, 1, 0), b= <a, B, y), e= (0, 2, 0) 


2. Find the values of ^ for which the vectors a = (3, 4, 2), b = (6, 8, 7), 
€ = (9, 12; X) are coplanar. 


Chapter 4 


THE EQUATION OF A LINE IN THE PLANE. 
A STRAIGHT LINE IN THE PLANE 


4.1. THE EQUATION OF A LINE 


Suppose that we have an equation 
F(x, y) 20 (i) 


in two unknowns x and y. If the numbers xo and yo are such 
that their substitution into the equation turns it into an equality 
then we say that the pair of numbers xo, yo satisfies the equa- 
tion. For example, the pair xo — 10, yo — 9 satisfies the equa- 
tion 1 — x + y = 0, while the pair xı = 2, yı = 3 does not. 

A pair of numbers x, y satisfying a given equation is not 
arbitrary: if x is defined, then y cannot be arbitrary since the 
value of y is related to x. It follows that a change in x causes 
a change in y. A point (x, y) will trace a line in the coordinate 
plane. 

1. A line defined by an equation. An equation corresponding 
to a line. 

Definition 1. Suppose we have an equation F(x, y) = 0. Then 
the set of all points (x, y) in a coordinate plane whose coor- 
dinates satisfy the equation is called the /ine defined by this 
equation. 

Definition 2. Suppose we have a line / in a coordinate plane. 
The equation corresponding to this line or simply the equation 
of the line / is an equation F(x, y) = 0 which defines the line /. 

In other words, if we have an equation and must find the 
line defined by this equation, we have to collect all the points 
(x, y) whose coordinates satisfy the equation. Conversely, if we 
have a line and must find its equation, we have to choose an 
equation in two unknowns such that it is satisfied by all the 
points of the line and only those points. 

If we have the equation of a line, then studying the geometric 
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M(x,y) 


Figure 48 Figure 49 


properties of the line reduces to studying its equation and this 
is the main purpose of analytic geometry. Its importance is that 
it is easier to study an equation than to investigate a line geo- 
metrically, especially since many methods have been developed 
in algebra and mathematical analysis for studying equations. 

The term running point of a line is often used when studying 
lines. This is a variable point M(x, y) which moves along the 
line (Fig. 48). The coordinates x and y of a running point are 
called running coordinates. 

An important remark must be made. An equation 
F(x, y) = 0 relating x and y can sometimes be solved for y, 
ie. y can be explicitly expressed in terms of x. In this case the 
above equation can be replaced by an equation of the form 
y = f(x). A curve defined by such an equation is called the 
graph of the function f(x) For instance, the equation 
1—x+y=0 implies the relationship »y2x-—1 (or 
x-2ydl. 

Finally we note that the equation of a line can be considered 
in any coordinates in the plane and not only in rectangular 
Cartesian coordinates. For instance, in polar coordinates the 
equation of a line is of the form f(r, ¢) = 0. 

Example 1. We have an equation x - y= Oor y — x. If x 
assumes an arbitrary value, then y assumes the value of x. Con- 
sequently, a line defined by this equation consists of a point 
equidistant from the x- and y-axes and located in the first or 
the third quadrant. In other words, the equation defines the 
bisector of the first and the third quadrant angles (Fig. 49). 
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Figure 50 Figure 51 


Example 2. We have an equation x + y = O or y = —x. This 
equation defines the bisector of the second and the fourth 
quadrant angles (Fig. 50). 

Example 3. We have an equation y = 0. Although the equa- 
tion does not involve the x coordinate, we may consider it to 
be an equation in two unknowns, x and y (in order to show 
that this assumption is valid, we can write this equation in the 
form 0-x + y = 0; now the left-hand side formally contains x). 
The line / defined by this equation consists of the points with 
zero ordinate; the abscissa can be arbitrary since the equation 
imposes no conditions on x. Consequently / is the, x-axis. 

Example 4. We have an equation x? + y? = 0. It is satisfied 
by a single pair of values of x and y, namely x = 0 and y = 0. 
Therefore, the equation defines just one point and this is the 
origin. We may consider this point as a degenerate line. 

Example 5. We have an equation sin? (x + y) — 3 = 0. Obvi- 
ously, this equation cannot be satisfied by any values of x and 
y. Consequently, this equation defines an “empty” line (a line 
containing no points). 

Example 6. We have an equation in polar coordinates 
r — acos 9 = 0, where a > 0 is a constant number. This equa- 
tion defines a circle of radius R = a/2 which passes through 
the pole O, the center of the circle lying on a polar ray (Fig. 51). 

Let us consider a point A on a polar ray at the distance a 
from the pole. The condition r = cos y means that 2 OMA is 
a right angle. Every point on the indicated circle possesses such 
a property and only they do. 
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M(x, y) 


Figure 52 


The examples considered above specified an equation and we 
had to find the line defined by the given equation. We now 
consider the inverse problem: to form the equation of a given 
line. Bear in mind that the desired equation can be written in 
different forms. ‘ For instance, x-—y=0, y-x=0, 
(x — y)? = 0, Ix — yl = 0 are different forms of the equation 
of the same line, namely the bisector of the first and third quad- 
rant angles. 

Example 7. Set up the equation of the circle with center at 
P(a, b) and radius R (Fig. 52). 

© We mark an arbitrary running point M(x, y) on the circle. 
Then the distance from M to P equals R: 


V(Qx- a) *( -b) =R (2) 


and this is the equation of the circle. Indeed, 

(1) if the point M(x, y) lies on the circle, then its x, y coor- 
dinates satisfy equation (2) since IMP | = R, 

(2) if a point N(x, y) is not on the circle, then its coordinates 
do not satisfy equation (2). Indeed, in this case the distance 
between N and P is not equal to R. 

A simpler form of equation (2) can be obtained by squaring 
its both sides: . 


(x - a)? + Q — b}? = R? 


which is an equation of a circle in rectangular Cartesian coor- 
dinates. 
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If the center P of the circle coincides with the origin, then 
a = 0, b = 0 and the equation assumes the form 


x+y? =R @ 


2. A problem on the intersection of two lines. 

Suppose that lines / and h are specified by their equations 
F(x, y) = 0 and (x, y) = O respectively. Let us find the points 
at which the lines intersect. From the viewpoint of analytic ge- 
ometry, finding intersection points means finding the coor- 
dinates in a given coordinate system. 

If M(xo, yo) is an intersection point, then F(xo, yo) = 0 since 
M belongs to /; and (xo, yo) = 0 as well, since M also belongs 
to b. Thus the numbers X and yo satisfy the system of 
equations 

F(x, y) =0 
{ (x, y) = 0 


Conversely, if a pair of numbers xo, yo satisfies the system, then 
the point M(xo, yo) belongs to the two lines simultaneously, 
that is, it is a point at which they intersect. 
Thus, in order to find the points of intersection of two lines, 
we must solve the system of the equations of these lines. 
If the system has no solution, then the lines do not intersect; 
if the system has a unique solution (a single pair of numbers 
Xo, Jo), then the lines intersect only at one point; if the system 
has several solutions, then the lines intersect at several points. 
Example. Find the point of intersection of the bisectors of 
the first and third quadrant angles with the circle of radius 
R = V5 centered on the y-axis a unit from the origin (Fig. 53). 
O We know that the equation of this bisector has the form 
x — y = 0. The equation of the circle is x? + (y — 1)? = 5. The 


system 
x-y=0 
Xr+(y- 1)? =5 
has two solutions: x1 = 2, y; = 2 and x; = —1, y; = —1. This 


means that the lines intersect at two points, (2, 2) and (— 1, 
-1). @ 
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Figure 53 


4.2. PARAMETRIC EQUATIONS OF A LINE 


Suppose that a point M moves along a line / in the plane. 
If a coordinate system is specified, the position of M is defined 
by its coordinates x and y. As the point moves, its coordinates 
vary, that is they depend on time: 


x= f(t), y = et) (1) 


The symbols f(t) and (t) mean there is a functional depen- 
dence. The variable ¢ (time) is a parameter; given a certain value 
of t we can find the values of x and y from formulas (1), i.e. 
the position of M. Therefore we usually say that formulas (1) 
specify the line / parametrically. 

Let us digress from physics considerations and suppose that 
we have a varying quantity ¢ (not necessarily time). We specify 
two functions 


x= f(t), y = et) Q) 


If we assign arbitrary values to the parameter ¢ (such that f(t) 
and e(t) are meaningful), then x and y will vary and the point 
M(x, y) will change its position in the coordinate plane. 
Definition. The set of all points M(x, y) whose coordinates 
are defined by formulas (2) is called the /ime represented 
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Figure 54 Figure 55 


parametrically, and formulas (2) are called the parametric 
equations or parametric representation of this line. 
Example 1. Find the line defined by the parametric equations 


x-Rcost, y=Rsint (3) 


© Squaring both sides of the equations and adding 
the results yields x? + y? = R?cos?t + R?sin?t = R*. But 

+ y? = R? defines a circle of radius R with center at the 
origin. Thus, any point whose coordinates are defined by for- 
mulas (3) lies on this circle. 

If t varies from 0 to 2r, then the point x = Rcost, 
y = Rsint traverses the entire circle. Therefore formulas (3) 
with the additional condition 0 < t < 2 consists of the paramet- 
ric equations of a circle. 

The meaning of the parameter ¢ can be understood from 
Fig. 54: t is the polar angle of the point M belonging to the 
circle. If we neglect the additional condition (0 < t < 27), then 
the point M will traverse the circle many times as t varies from 
—o to +o, 6 

Example 2. A triangle ABC with sides a, b, and c and oppo- 
site angles o, 8, and y slides with its vertices A and B along 
the coordinate axes. Set up the parametric equations of the 
curve described by the vertex C when it moves. Assume the an- 
gle ¢ is the parameter (see Fig. 55). 
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© Let the current position of C be defined by the coor- 
dinates x and y. We can write OC = OA + AC or in terms of 
coordinates 


(x, y) = €c:cos (x — (t + o), 0) + (boost, bsint) 
Whence 
x = c:cos(m — (t + a)) + bcost, y = bsint 
Using reduction formulas we simplify the above expressions: 


x= —c-cos(t + a) + bcost, y= bsint. @ 


4.3. A STRAIGHT LINE IN THE PLANE AND ITS EQUATION 


The position of a straight line in the plane can be defined 
in different ways. For example, we can specify 

(a) two points Mo and Mi belonging to the straight line, 

(b) one of the points Mo of the straight line and a vector 
a to which the line must be parallel, 

(c) one of the points Mo of the line and a vector n to which 
the line must be perpendicular, 

(d) one of the points Mo of the line and an angle o which 
the line must form with, say, the x-axis. 

We assume in this section that a rectangular Cartesian coor- 
dinate system is introduced in the plane. 

1. The equation of a straight line which passes through a 
given point perpendicular to a given vector. 

Given a point Mo(xo, yo) and a nonzero vector n = (A, B), 


Figure 56 
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Figure 57 


set up the equation of a straight line / passing through Mo and 
perpendicular to n (Fig. 56). m 

Obviously, M(x, y) belongs to / if and only if the vector MoM 
is perpendicular to n. The condition for two vectors to be per- 
pendicular is: the sum of the products of their corresponding 
coordinates is zero. Noting that the coordinates of the vector 
MoM are X — Xo, Y — yo, we write the condition that MoM L n 
in the form 


A(x — x) + BO — yo) = 0 (t) 


Thus, the point M(x, y) belongs to the straight line / if and 
only if its coordinates satisfy condition (1). Consequently, (1) 
is the equation of the straight line /. 

We arrive at a conclusion: the equation of the straight line 
passing through a point Mo(xo, yo) and perpendicular to the 
vector n — (A, B) has the form in (1). 

Note that a nonzero vector which is perpendicular to a given 
straight line is called a normal vector to the line. There are an 
infinite number of normal vectors to a straight line, all the vec- 
tors being mutually collinear. 

Example 1. The equation of a straight line passing through 
the point Mo(—1, 2) and perpendicular to the vector n = 
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(5, —4) has the form 
S(x- (-1) - 40 - 2) =0 


or, on simplifying, 5x — 4y + 13 = 0. 

Example 2. Three points A(—1, 2), B(3, 5), and C(4, —2) 
are given in the plane. Ascertain that the figure ABC is a trian- 
gle and set up the equation of its altitude drawn from the vertex 
A (Fig. 57). 

© The points A, B, and C are not collinear since the condi- 
tion of collinearity is not fulfilled: 

3-(-1 5-2 


| Em] mE 


Consequently, ABC is a triangle. 

Before forming the equation of the altitude drawn from the 
point A, we note that we can take the vector BC = <I, —-7) 
as a normal vector to the altitude. Hence it follows that the 
equation of the altitude has the form 1-(x — (-1) - 
7(y — 2) = Oor simpler x — 7y + 15 = 0. (There is no need for 
a drawing since we have solved the problem analytically.) @ 

2. The general equation of a straight line. 

After removing the brackets equation (1) assumes the form 


Ax + By + (—Axo — Byo) = 0 
We use C to denote the number (— Axo — Byo) and get 
Ax + By c C20 (2) 


Thus, every straight line is associated with a linear equation 
in two unknowns x and y, at least one of the coefficients A 
or B being nonzero (note that A and B are the coordinates of 
the normal vector to the straight line). 

Let us prove the converse: If we have an equation like (2) 
in which either A or B is nonzero, then this equation defines 
a straight line. 

For example, let B Æ 0. Then (2) can be written as 


AG - 0) + B(y- (- $)) =0 
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According to Item 1, this equation defines a straight line pass- 
ing through the point Mo (0, —C/B) and perpendicular to the 
vector n = (A, B}. Now, if A # 0, the reasoning is similar. 

Thus, if a rectangular Cartesian coordinate system is fixed 
in the plane, we have 

(1) every straight line is defined by a linear equation 
Ax + By + C= 0 in which A or B is nonzero, 

(2) an equation of the form Ax + By + C = 0 in which A 
or B is nonzero defines a straight line. 

The equation Ax + By + C = 0, provided that A or B is 
nonzero, is called the equation of a straight line in the plane 
written in a general form or simply the general equation of a 
straight line. 

Note an important fact we have established above when con- 
sidering (2): the coefficients A and B in the equation 
Ax + By + C=0 of a straight line are the coordinates of a 
normal vector to that line. 

3. The equation of a straight line which is parallel to a given 
straight line and the equation of a straight line which is perpen- 
dicular to a given straight line. 

Let us consider the general equation of a straight line 


Ax + By+C=0 (3) 


We have noted that the vector (4, B) is a normal vector to 
the straight line. 

The following propositions are valid: 

(1) If a straight line | is defined by equation (3), then any 
straight line parallel to | can be defined by an equation of the 
form 


Ax * By 4 C' 0 (4) 


(2) If a straight line | is defined by equation (3), then any 
straight line which is perpendicular to | can be defined by an 
equation of the form 


Bx - Ay+C” =0 (5) 
Let us prove proposition 1. At any value of C’ the straight 
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line (4) is parallel to the straight line (3) since it has the same 
normal vector. On the other hand, whatever the point Mo(xo, 
Jo), we can “make” the straight line (4) pass through this point 
by selecting the value of C’ (we must choose C' = —Axo — 
Byo); consequently, by varying C' we can get any straight line 
parallel to /. 

Let us now prove proposition 2. Normal vectors to the 
straight lines (3) and (5) are perpendicular since their scalar 
product is zero: 


A-B + B-(-A) = 0 


Consequently, for any value of C” the straight line (5) is per- 
pendicular to the straight line (3). On the other hand, whatever 
the point Mo(Xo, yo), a suitable selection of C" can make the 
straight line (5) pass through this point; this means that by 
changing C" we can obtain any straight line perpendicular to /. 

Example. A straight line / is defined by the equation 
3x — 7y + 12 = 0. Form the equation of a straight line passing 
through the point Mo(5, 1) parallel to / and the equation of 
the straight line passing through Mo and perpendicular to /. 

© The respective equations of the two lines are 3x — 7y + 
C" = Oand —7x — 3y + C" = 0. We choose C’ and C" such 
that the lines both pass through Mo: 


3-5 -7-1+C’=0, -75-31+C”=0 
Whence C’ = —8, C" = 38. The desired lines are 
3x-7y-820, -7x-3y+38=0 @ 


4. Studying the general equation of a straight line. 
Let a straight line in the plane be defined by the equation 


Ax + By+C=0 


where A # OorB = 0. Let us examine the location of a straight 
line relative to the coordinate axes depending on whether either 
of the numbers A, B, C is zero (or nonzero). 

( If 4-0, then the equation assumes the form 
By + C = Qor y = —C/B, which means that all the points of 
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Figure 58 


the line have the same ordinate, —C/B. Consequently, the 
straight line is parallel to the x-axis (Fig. 58). 

(2 If B=0, then the equation is Ax + C=0 or 
x = —C/A, which means that all the points of the line have 
the same abscissa, —C/A. Thus, the straight line is parallel to 
the y-axis. 

(3) If C = 0, then the equation is Ax + By = 0. One of the 
solutions of the equation is the pair of numbers x = 0, y = 0, 
which means that the straight line passes through the origin. 

(4) If A = 0 and C = O, then the equation is By = 0 or, since 
B = 0, y = 0. Thus, the straight line defined by this equation 
coincides with the x-axis. 

(5) If B = Oand C = 0, then the equation is Ax = 0 or, since 
A #0, x = 0. This equation defines the y-axis. 

It is easiest to construct a straight line from its equation by 
finding its two points. When B z 0, we write the equation in 
the form 

A C 
das iom 
take two different values of x, and find the corresponding 
values of y. Now, if B = 0, the straight line is parallel to the 
y-axis; in this case, in order to construct this line, we take any 
two of its points with abscissa x = —C/B. 

Example 1. Construct the straight line defined by the equa- 
tion 3x — 2y + 8 = 0. 

O We rewrite the equation as y = (3/2)x + 4. Then take, for 
instance, xy = 0 and x; = —2 and find that y = 4 and y: = 1. 


| 
| 
| 
| 
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(0,4) 


Figure 59 Figure 60 


We draw a straight line through the points (0, 4) and (—2, 1) 
(Fig. 59). @ 

Example 2. Construct the straight line defined by the equa- 
tion 3x + 2y = 0. 

O Since the constant term is absent, the line passes through 
the origin. Consequently, we know one point. Let us find 
another. Setting x = 2, we have y = —3. We draw a straight 
line through the origin and the point (2, —3) (Fig. 60). @ 

5. The equation of a straight line passing through a given 
point parallel to a given vector. 

Given a point Mo(xo, yo) and a nonzero vector a = (p, q}, 
find the equation of the straight line passing through Mo 
parallel to a (Fig. 61). 

We restrict our consideration to the case where p # 0, q ¥ 0: 
if p = 0 or q = 0, then the straight line / is parallel to the y- 
or x-axis, and its equation has the form x = xo or y = yo 
respectively. 


Figure 61 


8— 366 
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Obviously, a point M(x, y) belongs to / if and only if the 
vector MoM is collinear with a, which, in its turn, is the case 
if and only if the coordinates of MoM are proportional to the 
corresponding coordinates of a, i.e. 


X — Xo y — Jo 
p q 


Thus, (6) is the equation of the straight line passing through 
Mo(xo, yo) parallel to a = (p, q}, where p#0, q #0. 

A nonzero vector parallel to a straight line is called the direc- 
tion vector of that line. Every straight line has an infinite num- 
ber of direction vectors, these vectors being mutually collinear. 

Proceeding from the foregoing, we can easily solve another 
problem: to set up the equation of a straight line passing 
through two given points Mo(xo, yo) and Mi(xi, yi). We take 
the vector MoM, as a direction vector, whose coordinates are 
P=X — Xo and q = yi — yo respectively. Assuming that 
Xj Æ Xo and y; ¥ yo, we find from (6) that the equation of / 
has the form 


(6) 


X — Xo eed 
X1-—X Y¥i- yo 


(7) 


Thus, (7) is the equation of the straight line passing through 
the two points Mo(%, yo) and Mi(xi, yi), where x, x xo, 
Yı Æ yo, or the two-point equation. 

When x; = xo, the straight line is parallel to the y-axis and 
its equation is x = xo. When yı = yo, the straight line is parallel 
to the x-axis and its equation is y = yo. 

Example 1. Given three points A(—1, 7), B(0, 9), C(1, 3), 
set up the equation of the straight line passing through A 
parallel to the straight line BC. 

O The coordinates of BC are 1 — 0 and 3 — 9 respectively, 
ie. 1 and —6. We substitute xo = —1, yo = 7 (the coordinates 
of A) and the assumed values p = 1, g = —6 into equation 
(6) and obtain 


xt boye 
1 —6 


or 6x+y—-1=0®@ 
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Example 2. Given a triangle with vertices A(0, 7), B(2, 5), 
C(6, —4), set up the equation of the median from C. 
© Let P be the midpoint of the side AB whose coordinates 
are 
042 7+5 


X = =i Yo = 2 =6 


Since the median passes through C and P, we can write its equa- 
tion as that of the straight line passing through two points 
Mo(1, 6) (the point P) and Mi(6, —4) (the point C). We have 


xl y-B 
6-1 -4-6 


or 2x 3 y - 8-00 


6. An intercept form of the equation of a straight line. 
Suppose that two points differing from the origin are speci- 
fied on the coordinate axes: A(a, 0) on the x-axis, with a # 0, 
and B(0, b) on the y-axis, with b x 0. Let us consider the 
equation 
x 


y= 
a P 1 (8) 


This equation is satisfied by the coordinates of both A and 
B. Consequently, (8) defines the straight line AB (Fig. 62). 

Equation (8) is called the intercept equation of a straight line 
with the numbers a and b being the intercepts. Any equation 
of the form 


Ax * By- C20 


Figure 62 Figure 63 
8* 
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Figure 64 


can be reduced to the form of (8) if the three numbers A, B, 
and C are nonzero. We first write the equation as 
Ax + By = —C, then divide both sides by —C to get 
ee diee 
scm ug! 
This is the intercept form of the equation of a straight line, 
where a = -C/A and b= -C/B. 
For example, the equation 5x — 2y + 10 = 0 can be trans- 
formed as above to get 


AC. Sere A 
Bola 


This equation defines the straight line passing through the 
points (—2, 0) and (0, 5) (Fig. 63). 

Note that intercept equations are convenient for constructing 
straight lines. 

Example. We draw a straight line through the point P(3, 2) 
so that P is the midpoint of the line segment AB (Fig. 64). 
O We write the equation of the desired straight line as 
X4] 


a b 


Since the point P is the midpoint of AB, where A(a, 0) and 
B(0, b), we have 


Ch. 4. A Straight Line in the Plane 117 


that is, a = 6 and b = 4. Thus, the equation of our straight 
line is 


7. Slope of a straight line. 

Consider a straight line / in the plane. The angle of inclina- 
tion of l to the x-axis is e (Fig. 65), through which we must 
turn the x-axis for it to coincide with /; such angles are meas- 
ured counterclock wise. 

This definition is not unique. In fact, if we turn the x-axis 
through an angle of o + mn, where n is an integer, it will also 
coincide with /. The angle of inclination e is the least nonnega- 
tive angle of rotation, ie. such that 0 € o < s. 

The quantity k = tan y is called the slope of the straight line 
l. 

` If e = «/2, then tan e = 0. Hence, a straight line which is 
parallel to the y-axis has no slope. All other straight lines hav 
slopes. : 

If o = 0, then tan e = 0. Hence the straight line is parallel 
to the x-axis and its slope is zero. Conversely: if k = 0, then 
tan e = 0 and hence ¢ = 0, and this means that the straight 
line is parallel to the x-axis. 

For finding the slope of a straight line, we must know its 
two points. If Mo(xo, yo) and Mi(xi, yı) are two points of a 


Mx , y) 


Figure 65 Figure 66 
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straight line /, then its slope is 


End — yo 
k X1 — Xo o 


The proof follows directly from Fig. 66. If two straight lines 
are parallel, then their respective slopes are equal. 

Example 1. The slope of the straight line passing through 
points A(a, 0) and B(0, b) is 


b-0 b 


ME a 


We assume here that a is nonzero. 
Example 2. The slope of the straight line passing through 
the origin O(0, 0) and a point A(a, b) is 


We assume here that a is nonzero. 

8. The equation of a straight line passing through a given 
point and having a given slope. 

Suppose we have a point Mo(xo, yo) in the plane and a slope 
k. Let us derive the equation of the straight line which passes 
through Mo and have the slope k. 

If a point M(x, y) belongs to /, then from (9) we have 


ZU 
X — Xo 


Now, if M does not lie on /, then the above equation is not 
satisfied. Thus this equation is the equation of the straight line 
I. We rewrite it as 


y — yo = k(x — xo) (10) 


Thus, (10) is the point-slope equation of the straight line | that 
passes through Mo(xo, yo) and has slope k. 

If a point Mo lies on the y-axis, ie. Mo(0, b), then (10) as- 
sumes the form y — b = k(x — 0) or 


y=kx+b (11) 
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This is the form of the equation of the straight line that has 
slope k and whose y-intercept equals b. 

Note that the equation of any straight line having a slope, 
ie. which is not parallel to the y-axis, can be written in the 
form (10) or (11). 

Equation y = kx + b is the slope-intercept equation of the 
straight line with slope k and y-intercept b. If the straight line 
defined by a general equation 


Ax + By +C=0 (12) 


is not parallel to the y-axis (i.e. B # 0), then in order to reduce 
its equation to the form in (11) it is sufficient to write (12) as 
By = —Ax — C and then divide its both sides by B. We get 


ccu C 
S R 
Whence it follows that the slope of the line (12) is 
c A 
k= B (13) 


Example 1. Find the slope of the straight line 
3x + Sy-—7=0. 

© We reduce this equation to the form y = kx + b. We have 
Sy = -3x +7 or y = —(3/5)x + 7/5. Thus, the slope is 
k= -3/5, @ 

Example 2. Derive the equation of the straight line that pass- 
es ‘through the point Mo(2, 1) and has the slope k = -1. 

© Substituting the data into equation (10) yields y — 1 = 
-]-(x-2)orx-y-320.0 

Example 3. A light ray is directed along the straight line 
y = 3x — 5. When it reaches the x-axis it is reflected. Find the 
equation of the reflected ray. 

O We find the point at which the incident ray and the x-axis 
intersect. Since the ordinate of the point of reflection is zero, 
we put y = 0 in the equation of the ray. We have 3x — 5 = 0 
or x = 5/3. We know a point Mo(5/3, 0) on the reflected ray; 
it remains to find its slope. If the angle of inclination of the 
incident ray to the x-axis is y, then the angle of inclination of 
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Figure 67 Figure 68 


the reflected ray is 9 (Fig. 67) We know that 
tan(—g) = —tang. We can see from the equation k = 3 for 
the incident ray. Hence, for the reflected ray k = —3. Substitut- 
ing the coordinates of Mo and the value of k into (10), we have 
y-0-2-3(x- 5/3) or 3x + y — 5 = 0, which is the equa- 
tion of the reflected ray. 

9. The angle between straight lines. 

Let us consider two straight lines /, and h in the plane 
(Fig. 68). One of them we call the first and the other the second. 
Let kı and k be the slopes of these lines. j 

The angle y through which we must turn the first straight 
line for it to coincide with the second is the s/ope of the second 
straight line to the first. The sign of the angle y depends on 
whether such a rotation is counterclockwise or clockwise. Obvi- 
ously, the angle o is determined to within the term multiple 
of =. 

If yı and e» are the slopes of the first and second straight 
lines to the x-axis respectively, then o = «2 — «1. Whence 


tan e» — tan qi 


dui MENU TI 


But tan yı = kı and tan g = k2, thus, 


ky — ky 


Man dai rer 7a 


(14) 
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We have obtained a formula for finding the angle between 
two straight lines, provided we know the slopes of the lines. 

Formula (14) is senseless if the denominator on its right-hand 
side is zero. In this case tan y does not exist, and this means 
that y = 2/2. Hence we have the condition for two straight 
lines to be perpendicular. kik, + 1 = 0. 

This condition is usually written as 

k=- -L iy x) (15) 
ky 

Example 1. Find the angle o between the straight lines 
3x — 5y + 7 = 0 and 2x —- 3x+4=0. 

O We find the slopes of the lines: kı = 3/5 and kz = 2/3. 
Whence 


From tables of trigonometric functions we find that 
y = 2°48’. @ 

Example 2. A ray of light is directed along the straight line 
y = 2x — 4. On reaching the straight line y = x, the ray is 
reflected. Find the equation of the reflected ray. 

© Find the point at which the incident ray and the line y = x 
intersect. Solving the equations y = 2x — 4 and y = x simul- 
taneously yields x = 4, y = 4; the intersection point is Mo(4, 4). 

If we denote the angle of inclination of the incident ray to 
the line y = x by y, then we have 


The angle of inclination of the reflected ray to the line y = x 
is obviously — y, and its tangent —tan y = —1/3. Then denot- 
ing the slope of the reflected ray by k and using (14), we can 
write 
l1. k-lI 
3 1+kl 
whence it follows that k — 1/2. 
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Thus, the reflected ray passes through the point Mo(4, 4) and 
its slope is k = 1/2. The equation of this ray is 


y-4=5-@-4) 
r 
j x-2y+4=00 


4.4. RELATIVE POSITION OF TWO STRAIGHT LINES 
IN THE PLANE 


Suppose that we have two straight lines /, and h in the plane 
which are defined by the equations 


Aix + Biy+C, =0 (1) 
for /, and 
Aix + By +C, 20 (2) 


for h. 

Let us show how we can find the relative position of the 
straight lines /; and /; from their equations. Three cases are 
possible. 

Case 1. The lines coincide. Whence it follows that their nor- 
mal vectors, nı = (Ai, B1) and m = (45, B»), are collinear. 
Since n; # 0, there is a number such that n; = Am, that is, 
A2 = XA; and B; = AB;. We write the equations of the straight 
lines as 


Aix + By+ C120 (for A) 
AAix + ABiy t C520 (for h) 


Any point of /; (or of h) satisfies each of the above equations 
and hence the equation 


0-x + O-y + (Co -AC1) 20 (3) 


which results from the subtraction of both sides of the first 
equation multiplied by À from the respective sides of the second 
equation. An equation of form (3) has a solution if its constant 
term is zero, ie. C) = AC. 
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Thus, the condition for the straight lines ly and h to coincide 
is the existence of a number X such that Az = MA1, B; = dBi, 
C2 = AC}. 

This condition means that one of equations (1), (2) is ob- 
tained from the other by multiplying both sides by the same 
number A. 

Case 2. The straight lines do not coincide but are parallel. 
As in case (1) it follows that their normal vectors are collinear, 
Le 42 = A41, B; = AB,. Since /; and h do not coincide, we 
must have C; x AC). 

Thus, the condition for the straight lines l| and b to be 
parallel and not coincident is the existence of a number X such 
that A3 =A, B2 = Bi, C43 £ AC). 

Example. The straight lines 6x — 2y + 4 = Oand 9x — 3y + 
6 = 0 coincide since 6: 9 = (—2):(-3) = 4:6. These lines 
are parallel since 6:9 = (—2):(—3) # 4:5. 

Case 3. The straight lines /; and h% are not parallel, they inter- 
sect at a single point. In this case their normal vectors, n; and 
nz, are not collinear. Conversely, if n; and n; are not collinear, 
then /, and h are not parallel. 

Thus, the condition for the straight lines l| and b to be not 
parallel is that their normal vectors are not collinear. 
Noting that vectors (.41B1) and (42B2} are not collinear 
if and only if the determinant formed from their coordinates 
is nonzero (see Sec. 3.10), we can infer that the condition for 
the straight lines ly and b to be not parallel is 


A; Bi 


A B #0 (4) 


If / and / are not parallel, we can find their point of inter- 
section by solving the system of equations (1) and (2). Accord- 
ing to (4), the solution can also be found using Cramer's rule. 

Example. Verify that the straight lines 2x — 3y + 5 = 0 and 
X — 5y + 6 = 0 are not parallel and find the point at which 
they intersect. 
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© The lines are not parallel since 
2 -3 


i =| -720 
We find the point of their intersection by solving the system 
2x- 3y = -5 
x- 5y = -6 
According to Cramer’s rule, 
-5 -3 2 -5 
2 l=8 5 i | -6| _ 
Se see d room 


Thus, the lines intersect at the point (—1, 1). @ 


4.5. PARAMETRIC EQUATIONS OF A STRAIGHT LINE 


Suppose a straight line passes through a point Mo and 
parallel to a vector a (Fig. 69). Then the condition for a point 
M to belong to this line is the collinearity of the vectors MoM 
and a. In other words, the point M lies on this line if and only 
if the following equation is satisfied for a number t: 


MoM = ta (1) 
We introduce a rectangular Cartesian coordinate system in 


the plane, specify a point Mo(xo, yo) and a vector a = (p, q) 
in this system, and denote the coordinates of the running point 


Figure 69 
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M by x and y. Then the vector equation (1) takes the form 
X — Xo = ID, y — yo = tq or 


X — Xo - fp, y — yo * tq Q) 


If the parameter ¢ varies from — œ to + œ then the point M, 
whose coordinates x and y are defined by formulas (2), runs 
along the entire straight line. 

Thus, formulas (2) are parametric equations of the straight 
line passing through the point Mo(xo, yo) and parallel to the 
vector a= (p, q}. 

Example. The parametric equations of the straight line pass- 
ing through the point Mo(2, —1) and parallel to the vector 
a = (7, 3) have the form x 22 + 7f, y = —1 3t. 


4.6. DISTANCE BETWEEN A POINT 
AND A STRAIGHT LINE 


Given in the plane a point Mo(xo, yo) and a straight line / 
defined by the equation Ax + By + C = 0, find the distance 
d from Mo to l. 

If MiGa, yı) is a point of /, then the desired distance (Fig. 
70) is 

d = Iproj, MoM, | 


where proj, is the projection of a vector MoM, on n and n is 


n=< A, B» 


Mo(Xo, Yo) 


Figure 70 
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a normal vector to /; or else 


IMoM;-nI 


g= Ini 


We can take the vector n = < A, i B) as a normal vector to the 
straight line /. Noting that MoM, = (x1 — xo, Yı — Yo) , we 
obtain 


= lAQa — Xo) + B(yi — yo)l 


i JETE Š 


Since M; belongs to /, we have Axı + By; + C = 0. Thus 
we can write formula (1) as 


_ lAxo + Byo + Cl 


d 
VA? + B? 


(2) 


We have arrived at the following rule: in order to find the 
distance from a point Mo to a straight line | defined by the 
general equation we have to substitute the coordinates of Mo 
Jor the running coordinates x, y in this equation afd divide 
the absolute value of the result by NV A? + B?. 

Example 1. Given the equation 3x — 4y + 10=0 of a 
straight line, find the distance between the point M(4, 3) and 
this line. 

O From formula (2) we have 


g- 13:4-43 +101 _ 10 
ar +g 5 


Example 2. Find the distance between the straight lines 
3x — 4y + 10 = 0 and 3x — 4y- 5= 0. 

O We choose a point (3, 1) on the second line and find its 
distance to the first line: 


da- [23-414 101 15 3 @ 


=2 @ 
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4.7, HALF-PLANES DEFINED BY A STRAIGHT LINE 


Suppose we have a straight line in the plane in rectangular 
Cartesian coordinates: à 


Ax + By+C=0 (1) 


This line divides the plane into two half-planes; the line is their 
boundary or edge (Fig. 71). 

Let us prove the following proposition: one of the half-planes 
defined by (1) is composed of points (x, y) for which 


Ax + By - C20 (2) 
and the other is composed of points (x, y) for which 
Ax + By - C«0 (3) 


o First we assume that B ¥ 0. Then (1) is equivalent to the 
equation y = kx + b. Inequalities (2) and (3) reduce to the 
forms y 2 kx + b and y < kx + b respectively. The first in- 
equality defines the half-plane lying above the straight line 
y = kx + b (Fig. 72) and the second the half-plane lying below 
the line. Now, if B — 0, then A is necessarily nonzero and (1) 
is equivalent to the equation x = c; (2) and (3) reduce to the 
forms x > c and x « c respectively. The first inequality defines 
the half-plane lying to the right of the straight line x = c and 
the second the half-plane lying to the left of the line. m 


Figure 71 
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(x 37 Y1 > gg- kx +b 


(x Yo) 


(x, uo) 


Y2<Yo= kx+d 


Figure 72 


Example 1. Check whether the points A(—1, —3) and B(2, 
2) lie on one side or on different sides of the straight line 
2x-y—-—3=0. 

O We substitute the coordinates of A and B into the left- 
hand side of the equation. We have2-(—1) — (—3) — 3 = -2 
and 2-2 — 2 — 3 = —- 1. The resulting numbers have the same 
sign. Hence the points A and B lie on one side of the line. @ 


Exercises to Chapter 4 
4.1 


1. Find the lines defined by the following equations: (a) x = 0, (b) xy = 0, 
©) (x — 2Xx + 3) = 0, ( Ixl - Iyl 20, (0o X2- y? = 0. 

2. A straight line is displaced so that the area S of a triangle which the line 
forms with the coordinate axes remains constant. Find the equation of the line 
described by the midpoint of the segment of the straight line intercepted by 
the coordinate axes. 

3. Describe the line defined in polar coordinates by the equation: (a) e = 0, 
(b) r = I, (c) r = e (the spiral of Archimedes). 

4. In order to balance a body of mass m on an inclined plane forming an 
angle o with the horizontal plane, we must apply a force Q = mg sin a (Fig. 
73). Graph the function Q(o) in polar coordinates assuming that « is the polar 
angle and Q is the polar radius. 


Figure ?3 
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5. Find the points of intersection of two circles. One circle is centered at 
a point (a, 0) on the x-axis with radius a, and the other at a point (0, 5) on 
the y-axis with radius b. 


4.2 


1. A body of mass m is thrown with an initial velocity vo at an angle o 
to the horizon. Find the parametric equations of its trajectory with time £ as 
a parameter. 

2. A thread is wound onto a fixed disk of radius 1. Then the thread is wound 
off by pulling its free end so that there is no sag. Compile the parametric equa- 
tions of the trajectory of its free end. Take the center of the disk as the origin 
and direct the x-axis toward the free end of the thread (before unwinding). 
Assume that the length of the unwounded thread is the parameter. 


43 


1. Compile the equation of the indicated straight line: 

(a) through the point (3, — 1) and parallel to the bisector of the first quadrant 
angle, 

(b) through the point (3, — 1) and perpendicular to the bisector of the first 
quadrant angle. 

2. Drop the perpendiculars to the straight line 7x + 3y — 21 = 0 from the 
points of intersection with the coordinate axes. Find the equations of the per- 
pendiculars. 

3. Find a point B symmetric to A(— 1, 10) with respect to a straight line 
x-2y+6=0. 

4. Given the vertices A(2, —1), B( —1, 3), C(4, 0) of the triangle, write the 
equations of its sides. 

5. Find the equations of the straight lines drawn through the vertices of 
4 ABC (see the previous exercise) and parallel to the opposite sides. 

6. Find the points at which the straight line drawn through the points 
(-7, 1) and (5, 6) intersects the coordinate axes. 

7. Given the equations of the sides of A ABC: 3x + 2y — 8 = 0 (for AB), 
4x — y — 7 = 0 (for AC), and 10x — 3y + 41 = 0 (for BC), find the coor- 
dinates of its vertices. 

8. Compile the equations of the altitudes of A ABC (see the previous ex- 
ercise). 

9. Find the equation in intercept form of the straight lines: y = —2x + 3, 
5x + 3y — 15 = 0, and 3x - 2y + 12 = 0. 

10. Find the area of a triangle formed by the straight line 
—3x + 2y + 6 = 0 and the coordinate axes. 

11. Write the equation of a side of a square whose diagonals serve as coor- 
dinate axes. The length of a side is a. 

12. Draw a straight line through the point (3, —4) so that the area of the 
triangle it forms with the coordinate axes equals 3. 
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13. Find the slopes of the straight lines x/2 + y/7 = 1 and 9x — Ty + 
1=0. 

14. Draw a straight line through the point (—1, 7) perpendicular to the 
straight line y = (2/5)x + 7. 

15. Form the equations of the legs of an isosceles right triangle if the equa- 
tion of its base is y = 3x + 5 and (4, —1) is the vertex of the Tight angle. 

16. A ray is directed from the point (6, 9) to the straight line 
2x — Sy + 6 = Oat an angle of 45°. Find the equation of a ray reflected from 
this straight line. 

17. Given in an isosceles triangle the equation of its base 2x + y + 3 = 0, 
the equation of one leg x + 4y + 5 = 0, and the point (— 28/5, 6/5) lying 
on the other leg, find the equation of the other leg. 


4.4 
1. Find the relative position of the indicated straight lines: 


(a) 3x- 6y-9=0 (b) 3x- 6y-9=0 


-2x + 4y+6=0 -2x + 4y+5=0 
(c) y= -(5/3)x +2 (d 5x-3y+11=0 
S5Sx+3y-1=0 -4x - 7y + 10- 0 


2. Find the points on the y-axis which are equidistant from the origin and 
from the straight line 4x — 3y + 12 = 0. 

3. Find the point on the straight line x + y = 1 equidistant from the straight 
lines 2x — 9y = 1 and 6x - 7y = 2. 

4. Given the triangle with the vertices A(2, 1), B(—13, 5), C(7, 3), find the 
altitude from C to the side AB. 

5. Find the distance between two straight lines 3x — 4y + 12 = 0 and 
3x- 4y-7=0. 

6. Set up the equations of straight lines which are parallel to the straight 
line 3x — 4y + 12 = O and a distance 7 units away from it. 

7. One side of a square lies on the straight line x — 3y + 1 = 0 ahd one 
of its vertices is at the point (3, 0). Find the equations of its remaining sides. 

8. Through the origin draw tangents to the circle (x + 4)? + (y — 3)? = 16. 


4.7 


1. Check whether the point A(S, 2) and the origin lie on one side or on 
different sides of the indicated straight lines: (a) 7x — 12y + 19 = 0, 
(b) 2x - 9y +3 =0. 

2. Check whether the straight line 5x + 4y — 20 = 0 intersects the line seg- 
ment AB, where A(3, 1) and B(6, —1). 

3. Given the points A(3, 2), B(1, 1), and C(—2, 0), find whether the entire 
triangle ABC lies on one side of the straight line 5x — 3y — 6 = 0. 


Chapter 5 


CONIC SECTIONS 


This chapter deals with second-degree curves or conic sections. 
In rectangular Cartesian coordinates in the plane these curves 
are defined by second-degree algebraic equations whose general 
form is 


Ax? + Bxy + Cy? + Dx + Ey + F=0 


where at least one of the coefficients A, B, C is nonzero. 

The ellipse, hyperbola, and parabola are most interesting 
conic sections and are often used in mathematics and its appli- 
cations. 


5.1. THE ELLIPSE 


1. The definition and canonical equation of an ellipse. 

Definition. An ellipse is the set of all points in a plane such 
that the sum of the distances from two fixed points, called the 
foci, is a constant greater than the distance between the foci. 

A special case of an ellipse when its foci coincide is the circle 
which is the set of all points in the plane a given distance called 
the radius away from a fixed point. 

We can construct an ellipse from its definition (Fig. 74). Take 
a sheet of plywood, and fix two pins at points F; and F2. Then 
make a loop of a thread and put it over both pins. If we pull 
the thread taut with the tip of a pencil and move the pencil 
over the surface of the plywood so that the thread remains taut, 
then the line left behind will be an ellipse with foci F, and Fz. 
By changing the distance FjF? between the foci and the length 
of the thread, we get different ellipses. 


DE 
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Figure 74 Figure 75 


The ellipse is the most common curve in Nature and technol- 
ogy. For instance, the planets orbit the sun along ellipses with 
the sun at one of the foci (Fig. 75), if a circular cylinder is 
cut by a plane not parallel to its axis, then the section is an 
ellipse (Fig. 76). 

The notion of an ellipsoid of inertia is used in mechanics. 
This is an imaginary curve whose shape depends on the shape 
of a plate (Fig. 77). 

Let us derive the equation of an ellipse. Suppose that two 
points Fj and Fz are the foci of an ellipse and M its point. 
We use 2c, rı, and rz to denote the lengths of the line segments 
F\ Fo, FiM, and FM (Fig. 78). 

The sum of the distances Fı M and FM is a constant charac- 
terizing the ellipse. We denote this constant by 2a. By definition 
2a > 2c. Note that if 2a = 2c, then F1M + F;M = 2a is only 


Figure 76 
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Figure 77 


satisfied by the point of the line segment F,F2, while for 
2a « 2c none of the points on the plane satisfy this condition. 
Thus, by the definition of an ellipse, 


mt+m=2a (1) 


Let us introduce a rectangular Cartesian coordinate system in 
the plane so that the x-axis passes through the foci and the 
y-axis divides Fı F> in half (Fig. 79). Then (x, y), (— c, 0) and 
(c, 0) are the coordinates of the points M, Fi, and F> respective- 
ly. Using the formula for the distance between two points we 
can write 


n-2NY(x-c-y, nm-eN(x-cy«y 
Substituting these expressions into (1) yields 
N(x +e e y! e N(x - cy + y! = 2a Q) 


and this is the equation of an ellipse since it is satisfied by the 
coordinates of any point M belonging to the ellipse and is not 


F, 2c Fy 


Figure 78 


Figure 79 
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satisfied by the coordinates of points not belonging to it (for 
such points we have rı + m # 2a). 


Equation (2) can be simplified. We first write it as 
V(x + c + y^ 224- N(x- c + y? 
and square both sides of the last equation: 


(x + c) + y? = 4a? -AaN(x- c)? y! + (x- ey + y 


By collecting like terms we have 


Acx = 4a? — AaN (x — cy, + y’ 
an(x — c} +y = a? -cx 


Then we square both sides of the last equation: 


or 


ax? + ac? — 2a ex + ay? = ad? — 2aex + cx! 
The result of a simple transformation is 


(à? — e + ay? = aua - c) 


By assumption, the difference a? — c? is positive (a > c). We 
use b to denote the square root of this number: b = Va? — c’. 
Thus, b? = a? — c?, b’< a. By dividing both sides of the last 
equation by a?b?, we finally have 

x^ 


2 
y 
= +2 = 1 3 
a 2 (3) 


Sy 


This equation is obtained from (2). Proceeding in the reverse 
way we can show (check this) that equation (2) follows from 
(3). Thus, these equations are equivalent. Equation (3) is called 
the canonical equation of an ellipse. This is a second-degree 
equation and, hence, an ellipse is a second-degree curve. 

Remark. We have noted that in the given coordinate system 
the parameter a in (3) is greater than b. However, equation (3) 
can also define an ellipse when a is smaller than b if we change 
the coordinate axes, thereby reducing the situation to the case 
when a is greater than b. Hence, (3), when a < b, defines an 


NENN E E 
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Figure 80 Figure 81 


ellipse whose foci lie on the y-axis and the distance between 
the origin and the foci is c = Vb? — a’. 

2. Studying the shape of an ellipse from its canonical 
equation. 

If an ellipse is defined by a canonical equation and (x, y) 
is a point on it, then the points (x, —y), (—x, y), and (— x, 
—y) also belong to the ellipse. Indeed, squaring the numbers 
—x and —y yields x? and y and equation (3) is satisfied for 
all the indicated points. Hence it follows that the x- and y-axes 
are the axes of symmetry of an ellipse and the origin is the 
center of symmetry. 

The points at which the ellipse cuts the coordinate axes are 
vertices. Putting y = 0 we have from (3) x?/a? = l,ie.x = +a. 
Thus, the vertices lying on the x-axis have coordinates 
(—a, 0) and (a, 0). Similarly, we find that the coordinates of 
the vertices lying on the y-axis are (0, — 5) and (0, 5) (Fig. 80). 

Since the ellipse is symmetric with respect to the x- and 
y-axes, it is sufficient to study its shape within the first quad- 
rant, i.e. for x > O and y > O. In this case we find from (3) that 


ya? Nai - x 


This relationship shows that as x increases from 0 to a the coor- 
dinate y decreases from b to 0. As x increases further the differ- 
ence a? — x? becomes negative and y does not exist, hence it 
follows that the ellipse has no points with abscissas x greater 
than a. The part of the ellipse we have just considered is depict- 
ed in Fig. 81. We can show that the part of the ellipse lying 
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Figure 82 


Figure 83 


in the first quadrant is convex upward and the tangent at the 
vertex (a, 0) is parallel to the y-axis, while the tangent at the 
vertex (0, b) is parallel to the x-axis. The two tangents are shown 
by dashed lines in Fig. 81. 

The parts of the ellipse in the other quadrants are symmetric 
to the part in the first quadrant with respect to the coordinate 
axes (Fig. 82). 

The center of symmetry of the ellipse is simply called the 
center and it is the midpoint of the line segment F, Fz joining 
its foci. Any chord passing through the center of an ellipse is 
a diameter, Fig. 83 shows several diameters. The diameter pass- 
ing through the foci of an ellipse is called the major axis and 
the diameter which passes through the center and perpendicu- 
lar to the major axis is called the minor axis of the ellipse. 

If, as before, the x-axis passes through the foci F; and F, 
of an ellipse and the origin is the midpoint of the line segment 
FiF;, then the major axis coincides with the x-axis and the 
minor axis with the y-axis; in this case a > b in equation (3) 
of the ellipse since b? = a? — c?. In the case where a < b we 
find that the major and minor axes lie on the y- and x-axes 
respectively. 

The numbers a and b in the canonical equation (3) are the 
lengths of the corresponding line segments which join the 
center of an ellipse with its vertices and are called the semiaxes. 
If a > b, then a is the semimajor axis and b is the semiminor 
axis: if a « b, then b is the semimajor axis and a is the semi- 
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minor axis. In any case it is customary to denote the semiaxis 
lying on the x-axis by a and the other semiaxis by b. 

Example 1. Find the equation of an ellipse if its major axis 
is 10 and the distance between the foci is 8. Assume that the 
major axis coincides with the x-axis and the midpoint of the 
segment joining the foci coincides with the origin. 

O The equation of the ellipse in the given coordinate system 
is its canonical equation (3). Here we have b? = à? - č = 
5? — 4? = 9, hence b = 3. The desired equation is 


S SA a 
Example 2. Show that the curve 9x? + 4y? = 36 is an ellipse. 
Find the lengths of its axes and the coordinates of its foci. 
O Dividing both sides of the equation by the constant term 
yields 


thus the given curve is the ellipse whose semimajor axis is 3 
and the semiminor axis is 2. The foci F; and F; lie on the y-axis 
2c apart, where c? = 3? — 2?, ie. c = V5. The coordinates of 
the foci are (0, — V5) and (0, V5). 

3. The ellipse as a section of a circular cylinder. 

We show that any section of a circular cylinder by a plane 
not parallel to its axis is an ellipse. 

o Consider a circular cylinder and a plane œ intersecting 
it at a point O and use / to denote the curve of intersection. 

Draw a plane & through O perpendicular to the axis of the 
cylinder; the section is the circle i with radius r. We choose the 
coordinate axes in each plane o and à as shown in Fig. 84 (the 
x-axis is common to both planes and the axes of ordinates are 
the y-axis in the o plane and the j-axis in the à plane). 

Let M(x, y) be an arbitrary point on the curve /. Its projec- 
tion on the plane & is the point M on the circle /. Let x and 
j be the coordinates of M in the plane à. We can see from 
the figure that f = y cos e, where e is an (acute) angle between 
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Figure 84 Figure 85 


the planes a and à. Since M(x, ¥) lies on the circle with radius 
r and center at the origin, we have x? + 5? = r?, whence 


x! + y cos? g = r? 


Thus, the coordinates of any point M(x, y) of the curve / are 
related as follows 


+—¥ 21 (4) 


r? r 
COS e 


Conversely, if point M(x, y) in the plane o satisfies (4), then 
its projection M(x, y), where ¥ = fcos e, satisfies equation 
x? + y? = r° ie. lies on the circle /, hence the point M lies on 
the curve /. 

Thus, the curve of intersection of a cylinder and a plane is 
defined by equation (4), that is, is an ellipse with the semiminor 
axis r and the semimajor axis r/cos e, where r is the radius of 
the section of the cylinder and ¢ is the angle of inclination 
of the cutting plane to the perpendicular plane to the axis of 
the cylinder. m 

4. The equation of a tangent to an ellipse. 

Suppose an ellipse is specified by its equation in standard 
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form 

Jy 

T5221 5 
IU (5) 


We choose a point Mo(Xo, yo) on the ellipse, where yo # 0 (that 
is, a point which is distinct from the vertices A; and A2 (Fig. 
85)). A small portion of the ellipse near Mo is the graph of 
a function y = y(x) (we can explicitly express y in terms of x 
using (5)). Differentiating both sides of the equation 


x 3^) 
T 1 
a? b? 
with respect to x yields 
2x , 2yy 
T 1 
a’ b? 
whence it follows that 
, b? x 
y TET = c 


This means, for one, that the tangent at Mo has the slope 
2 
ice 2 Ao 
a” yo 


If we know the slope of a tangent and a point Mo(Xo, yo) 
on the tangent, then we can write the equation of the tangent 
as y — yo = k(x — xo) or 


2 
y — yo 7 — = — (X — xX) 
a yo 
Transposing the terms to the left-hand side yields 
2 
Xox 4 Yo _ (3 2 z) -0 
a b a 
or, since Mo lies on the ellipse, 


XoX Joy 
Far (6) 
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which is the desired equation of the tangent to an ellipse at 
a point Mo(xo, yo). Equation (6) only differs from equation (5) 
of an ellipse in that (6) is a linear equation in the running coor- 
dinates x and y. 

Now, let us take into consideration the vertices A; and A> 
of the ellipse. Suppose, for instance, that Mo coincides with A1, 
then we have xo = —a, yo = 0, and (6) assumes the form x = a, 
which is, in fact, the equation of the tangent at A1. Similarly, 
if Mo coincides with A2, then (6) transforms into the equation 
of the tangent at A>. 

Thus, the tangent to an ellipse defined by (5) at a point 
Mo(xo, yo) of the ellipse is defined by (6). 

Example. Find the equation of the tangent at a point Mo 
of the ellipse 


the tangent lies on the bisector of the first quadrant angle. 
© We first find the coordinates of Mo. Since xo = yo for 
ae 


this point, we have i16 * 9 


1 from the equation of the el- 


. : . 16-9 43 12 
lipse, i.e. x = Ss = 
Xo > 0). According to (6) the equation of the tangent to the 


ellipse at the point (? ; 2) has the form 


, whence xo = (bear in mind that 


5.2. THE HYPERBOLA 


1. The definition and canonical equation of a hyperbola. 

Definition. The hyperbola is the set of all points in the plane 
such that the absolute value of the difference of the distances 
from two fixed points called foci is a constant smaller than the 
distance between the foci. 

We use F; and F2 to denote the foci of a hyperbola and 2c 
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to denote the distance between the foci. Let M be an arbitrary 
point on the hyperbola and rı and rz the lengths of the segments 
FiM and FM. By the definition of a hyperbola, 


Iri — rl = 2a (1) 


where a is a constant characterizing the hyperbola (2a < 2c). 
In order to derive the equation of a hyperbola, we introduce 
a rectangular Cartesian coordinate system so that the X-axis 
passes through the foci and the y-axis divides the line segment 
FF; in half (Fig. 86). Then the coordinates of the points Fi, 
F;, and M are (—c, 0), (c, 0), and (x, y), respectively. By ex- 
pressing rı and rz in terms of the coordinates of Fi, F2, and 
M and substituting the results into (1), we obtain 


I (x + oy + Y? - N(x - ey) + y!l = 2a Q) 


Perform similar transformations as in the case of the ellipse. 
Write equation (2) in the form 


Vato + y) - N(x - cy + y* = £24 


transfer the second root to the right-hand side, and square both 
sides. The result is 


(x+ c}? + y? = Ad? + 4a Qo +r Hae Hy 
or RECEN 
cx — @ = tav(x— ey + y? 
Squaring both sides of the last equation yields 


c) — ta2ex + a^ = ax? — 2d) ex ad) + ay’ 


M(x,y) 


[0] 


F(-c, 0) F,(c, 0) 


Figure 86 
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or 
(c — a*)x? - dy? = a*(c* - a?) (2a) 


We introduce a new number b = Vc? — a’. Since c > a by 
definition, we have c? — a? > 0 and the number b exists; with 
b? = c? — a’ or, which is the same, c? = a” + b’. By dividing 
both sides of (2a) by a?b? we finally have 


e --l G) 


Equation (3) is derived from equation (2). If we perform trans- 
formations in reverse direction, then we can show that (2) fol- 
lows from (3) as well. Thus, these equations are equivalent. An 
equation of form (3) is called the canonical equation of a 
hyperbola. 

This is the second-degree equation and thus the hyperbola 
is a second-degree curve. 

Example 1. Find the foci of the hyperbola 

n em 


9 16 


O We have a = V9 = 3 and b = V16 = 4. The distance be- 
tween the foci is 2c, where c? = a? + b? = 9 + 16 = 25, ie. 
c = 5. The coordinates of the foci are (— 5, 0) and (5, 0). 6 

Example 2. Show that the curve 


ae 


A e ine] 
a b 
is a hyperbola and find its foci. 

O The given equation only differs from the canonical equa- 
tion in that its right-hand side is — 1 and not 1. We introduce 
a new system of coordinates, namely, we consider the x-axis 
to be the new ordinate denoted by Y-axis and the y-axis to be 
the new abscissa denoted by X-axis. In new coordinates the 

2 2 
given equation takes the form 2 — A = -l or 
a 
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This equation defines a hyperbola whose foci lie on the X-axis 
at the points (— c, 0) and (c, 0), where c = Va? + b^. In the 
initial xy coordinates the foci lie on the y-axis at the points 
(0, —c) and (0, c). @ 

2. Studying the shape of a hyperbola from its canonical 
equation. 

Since the power of each variable x and y in equation (3) is 
two, the hyperbola defined by (3) is symmetric with respect to 
the coordinate axes and the origin O is the center of symmetry 
of the hyperbola. Thus, to determine the shape of a hyperbola 
it is sufficient to consider its portion lying in the first quadrant. 

We first determine whether a hyperbola intersects the axes 


of symmetry. It follows from (3) that |x! > a since for |x| < a 


y! 


the expression > yi is less than unity. Thus, a hyperbola 
does not intersect the y-axis which is called the conjugate axis 
of the hyperbola. At the same time putting y = 0 we find that 
X = +a, which means that the hyperbola and the x-axis inter- 
sect at two points, A,(—a, 0) and A2(a, 0). These points are 
called the vertices and the x-axis the transverse axis of the 
hyperbola. 

Thus, the axis of symmetry which does not intersect the 
hyperbola is its conjugate axis and the axis of symmetry which 
intersects it at two points (its vertices) is its transverse axis. 

Assuming x > 0, y > 0 in the equation of a hyperbola we 
express y in terms of x 


2b EE 
pa x!-a 


We can see that as the coordinate x increases y also increases 
indefinitely. Figure 87 depicts the part of the hyperbola in the 
first quadrant. We can show that this part is convex upward 
and the tangent (drawn with a dashed line) to the hyperbola 
at the vertex Az is parallel to the y-axis. 

Since a hyperbola is symmetric with respect to the coordinate 
axes, it can be drawn as illustrated in Fig. 88. The hyperbola 
consists of two parts called branches, one of which corresponds 


144 Part One. Analytic Geometry 


A,(-a,0) 0 A,(a , 0) x 
Figure 87 


to x 2 a and the other to x < —a. The dashed lines in the 
figure are explained in the next subsection. 

3. The asymptotes of a hyperbola. 

Let us analyze the straight lines y — D x and y= — 2x 
which are shown by dashed lines in Fig. 88 and denoted by 
l+ and /_ in Fig. 89. These lines possess a remarkable property, 
namely, if a point M moves along a hyperbola receding in- 
definitely from the origin, then the hyperbola approaches 
closer and closer to one of the indicated straight lines. 

In order to verify this property we choose the point M in 
the first quadrant (it is sufficient to consider only this case since 
a hyperbola is symmetric with respect to the coordinate axes). 
Let M move along a hyperbola receding indefinitely from the 
origin O, with x > œ. The distance d = MP from M to the 
straight line /, is less than the distance MN (MP is the perpen- 


Figure 88 
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Figure 89 


dicular to /,). But MN is the difference of the ordinates of 
the hyperbola and the straight line /, . Since the branch being 


considered is described by the equation y = b. Vx? — a* and 


the line /, by y = P x, we have 
MN - P. (y - A? Ta) 


Multiplying and dividing this equation by x + Vx? — a? yields 


As the point M moves along the hyperbola receding in- 
definitely from the origin, its abscissa x becomes as large 
as desired (x > œ). We can say the same of the value x + 


Nx* — a°. Therefore, the fraction a becomes ar- 
x -a 
bitrarily small (MN —> 0), consequently, the distance d becomes 
arbitrarily small as well. 
Suppose for a curve L there exists a straight line / such that 
when a point M moves along L to infinity, this point ap- 
proaches / indefinitely, then the straight line / is called an 


10—366 
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asymptote of the curve L. Thus, the straight lines y = 5. x and 


y=- Dx are asymptotes of the hyperbola. 


Remark. In order to draw the hyperbola defined by equation 
(3) we construct a rectangle with center at the origin and sides 
2a and 2b, which are parallel to the x- and y-axes respectively 
(Fig. 90). The straight lines joining opposite vertices of the rec- 
tangle are the asymptotes of the hyperbola. Then sketch the 
branches of the hyperbola: the left branch should touch the 
rectangle from outside at the point A: (one vertex of the hyper- 
bola) and approach the asymptotes with its “ends” and the 
right branch touches the rectangle at the point 42 (the other 
vertex) from the outside and also approaches the asymptotes. 

The line segments a and b are semitransverse and semicon- 
jugate axes respectively. 

Example 1. Form the equations of the asymptotes of the 
hyperbola 


O We have a — 3 and b - 2, the equations of the asymp- 
totes are y — x and y= — x. e 


Example 2. Set up the equation of the hyperbola whose 
transverse axis is the x-axis, the equations of whose asymptotes 


Figure 90 
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are 5x — 4y = 0 and 5x + 4y = 0, and the distance between 
the vertices is 16. 

© We know that the asymptotes intersect at the center of 
a hyperbola. We infer from the equations of the asymptotes 
that the center of a hyperbola is at the origin. Thus, the x-axis 
is the transverse axis of the hyperbola (by definition) and the 
y-axis is its conjugate axis. The equation of the hyperbola has 
the form 


E cy eq 
g b 
In order to find the numbers a and b we reduce the equations 
of the asymptotes to the form y — i x and y= — ee 


whence we immediately have 5. = i But, by hypothesis, 
2a = 16, ie. a = 8. Consequently, b = 10. 
The desired equation has the form 


x uy 
3 90779 


4. The equation of a tangent to a hyperbola. 
The equation of a tangent to the hyperbola 


xr -S-1 (4) 


can be found in the same way as for an ellipse. Therefore, we 
just give the final result without repeating the argument. 
The tangent to the hyperbola defined by (4) at a point Mo(xo, 
Yo) on this hyperbola is determined by the equation 
WX mat cd 
e d 
5. The hyperbola as a graph of inverse proportionality. 
If in the equation of the hyperbola 
xy : 


P b? 


10* 
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y 


Figure 91 


the numbers a and b are equal, we can write 
x-yzg4 (5) 


A hyperbola defined by (5) is equilateral and its asymptotes 
are the straight lines y = x and y = —x, i.e. they are the bisec- 
tors of the quadrant angles. 

Let us turn the coordinate axes through 45? and use X and 
¥ to denote the new axes so that they coincide with the asymp- 
totes of the equilateral hyperbola (Fig. 91). We can show that 
the equation of the hyperbola in the new coordinates has the 
form 

gala 
2 

We know that two varying quantities X and j related as 
Xf = c, where c is a constant, are inversely proportional. Thus, 
we infer that an equilateral hyperbola is the graph of inverse 
proportionality. 


5.3. THE PARABOLA 


1. The canonical equation of a parabola. 

Definition. A parabola is the set of all points in the plane 
equidistant from a fixed point, called the focus, and a fixed 
line, called the directrix. 
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We use p to denote the distance between the focus and the 
directrix, the number p > 0 is the parameter of the parabola. 

In order to derive the equation of a parabola, we take the 
straight line which is perpendicular to the directrix and passes 
through the focus F as the x-axis and assume that it is directed 
from the directrix to the focus (Fig. 92). We place the origin 
at the midpoint of the line segment joining the focus and the 
directrix. The coordinates of the focus in our system are p/2, 
0 and the equation of the directrix is x = —p/2. 

Suppose M(x, y) is an arbitrary point on the parabola and 
P is its projection on the directrix, then, by the definition of 
a parabola, MP = MF or 


B pt 
= («-£) ay. (1) 


Squaring both sides of (1) and simplifying yields 
y! = 2px (2) 


It can easily be shown (we leave it to the reader) that the 
converse is also true, namely, we can derive (1) from (2). Thus, 
(1) and (2) are equivalent. Hence it follows that (2) is the equa- 
tion of the parabola. 

Equation (2) is the canonical equation of a parabola. We can 
see from this equation that a parabola is a second-degree curve. 

It also follows from the canonical equation that the parabola 
defined by it is symmetric with respect to the x-axis. In fact, 


p 
Xn 


Figure 92 
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if (2) is satisfied by the coordinates of point (x, y), then it is 
also satisfied by the coordinates of point (x, —y). In addition, 
all the points on the parabola have x > 0. The point (0, 0) is 
the leftmost point of the parabola (in the usual coordinate Sys- 
tem) and called the vertex of the parabola. 

We note n more that if a parabola is defined by the canon- 
ical equation y? = 2px in the given coordinate system, then its 
directrix is defined by the equation x = —p/2 and its focus 
is at the point F(p/2, 0). 

2. The parabola as the graph of a quadratic equation. 

School mathematics call the graph of the function 


y=ar+bxic 


a parabola, where a is nonzero. Let us ascertain that this defini- 
tion is consistent with that given earlier. 
We transform the equation as 


b x .b hi 
y= «(e +2 X+- EGRE zi (2) 
bN c = b ? b- 4ac 
E (2) +5) =a(x+ 2) da 


Introducing the notation 


"n ON: 
2a i 4a 


=6 
we can write 
y - B=a(x- ay 


Now, if we assume the point O(a, 8) to be the new origin and 
leave the direction of the coordinate axes unchanged, then the 
old and new coordinates are related as 


X-x-ajy-y-B 
and the equation of the curve we consider in the new system is 
j-aX or # = 2py 


with p = 1/2a. The last equation only differs from the canoni- 
cal equation of a parabola in the notation of coordinates. 
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Thus, a parabola is, in fact, the graph of a quadratic poly- 
nomial. 

Example. Find the directrix and focus of the parabola de- 
fined by the equation 


zl cU 
=5 x!- 3x46 
O We transform the equation as follows 
1 


PA (2 - 6x + elg - 6x + 9-9 + 12) 
(x -3y +3 


or 


232-1 231 
Then we translate the coordinate axes assuming the point Ó(3, 
3/2) to be the origin. The equation of a parabola in the new 
coordinate system has the form 


por? or č = py 


with p = 1. This equation only differs from the canonical equa- 
tion (2) in the notation of the axes. Noting the remark made 
at the end of subsection 1, we find that the equation of the 
directrix of the parabola is y = —p/2 = —1/2 and the focus 
is at the point with the coordinates f = p/2 = 1/2, X = 0. Pass- 
ing to the old coordinates via the formulas x = X + 3 and 
y = f + 3/2 yields the equation of the directrix, viz. y = 1, and 
the coordinates of the focus are x = 3 and y = 2 (Fig. 93). @ 

3. Optical properties of the parabola. 

One property of the parabola is widely applied in optics. We 
first formulate this property geometrically. 

Figure 94 shows the parabola y? = 2px with focus F, a point 
Mo(Xo, yo) on the parabola, the ray / emanating from Mo and 
parallel to the axis of the parabola (the x-axis), and the tangent 
to the parabola at Mo. The property we are interested in is that 
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Figure 93 Figure 94 


the tangent makes equal angles with the vector MoF and the 
ray l, 

This statement is obvious if Mo is the vertex of the parabola, 
i.e. Mo(0, 0). Suppose, therefore, that Mo is not at the point 
(0, 0). In this case a sufficiently small part of the parabola near 
Mo is described by the equation y = y(x). Differentiating the 
identity y^(x) = 2px with respect to x yields 2yy’ = 2p, whence 
y’ = p/y. This means, for one, that the slope of the tangent 
to the parabola at Mo is k = p/yo. 

It follows directly from Fig. 94 that 


Jo 


tano o cp 


where ¢ is the angle the vector FMo forms with the x-axis. We 
have at the same time 


tan2a = 2tna ..  2p/Yo , 2pyo 
l -tana 1l-—p*/y ye-p 
_ 2py _ Jo 


2pxo- p? Xo- p/2 
whence it follows that y = 2a, and since o = « + f, we have 
a = B which was desired to obtain. 
The property of a parabola we proved can be interpreted as 
follows. Suppose a source of light is placed at the focus F Light 
rays emanating from the focus are reflected from the parabola 
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Figure 95 


governed by the law of reflection, namely, the angle of inci- 
dence is equal to the angle of reflection. Since the angles a 
and f are equal (which we have just proved), the reflected ray 
is parallel to the x-axis. In other words, all the rays are reflected 
in a beam parallel to the axis. 

The reflection property is exploited in constructing parabolic 
mirrors which are used in telescopes, searchlights, automobile 
headlights, solar heating devices, and the like (Fig. 95). 


Exercises to Chapter 5 
5.1 


1. A vertex of a triangle whose base is fixed moves so that the perimeter 
of the triangle is constant. Find the trajectory of the moving vertex if the base 
of the triangle is 24 and the perimeter is 50. 

2. A line segment of constant length slides with its ends along the sides of 
a right angle. Prove that any point of the segment moves along the arc of an 
ellipse. 

3. Form the canonical equation of an ellipse if the distance between the foci 
is 6 and the semimajor axis is 5. 


4, Find a point on the ellipse x + y = 1 at a distance 4 from the minor 
aris: 20 15 

5. Given the semiaxes a and b of an ellipse with a > b, prove that the distance 
r from an arbitrary point on the ellipse to its center is such that b < r < a. 


6. Find the side of a square inscribed in the ellipse : + p el. 


7. Given an ellipse passing through the points (8, 3) and (2V5, 2v5), find 
its equation if we know that the coordinate axes are its axes of symmetry. 


8. Find the equation of the tangent to the ellipse F + n = | at the point 
(1, 3). 
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y 


x ; 
9. Find the tangents to the ellipse m + 5 = 1 with the slope k = —1 and 
determine the distance between these tangents. 
10. Prove that the straight line Ax + By + C = 0 touches the ellipse if and 
only if Aa? + Bb? = C. 
11. Find the tangents common to the two ellipses 


wl 
v| 


2 2 
dc ea +=] 
3 3 


12. Different chords are drawn through the vertex (a, 0) of the ellipse 
xt yt ; : 
ati x = ]. Find the line on which the midpoints of the chords lie. 
a 


13. Prove that the tangent to an ellipse at a point M forms equal angles 
with the line segments MF; and MF, where Fi and F, are the foci. 

14. A cylinder whose section is a circle of radius 10 is cut by a plane which 
makes an angle of 30? with the axis of the cylinder. The curve of the intersec- 
tion is the ‘ellipse: find the lengths of its semiaxes. 


52 


1. Form the equation of a hyperbola whose axes coincide with the coordinate 
axes if the distance between the foci is 20 and the distance between the vertices 
is 16. 

2. Given the foci Fi(—5, 0) and F> (5, 0) and the point C(4V2, 3) on the 


‘hyperbola, find its equation. 
2 


3. Find a point M on the hyperbola as - A = 1 for which MF, 1 MF:, 


where F, and F are the foci. 
2 
4. Find the foci and asymptotes of the hyperbola T —----] 


5. Find the equation of the hyperbola given the equation of its asymptotes 
y = +(3/2)x and the point (2V3, 3) on it. 
6. Prove that the product of the distances from any point on a hyperbola 
to its asymptotes is a constant. : 
7. Find the points at which the hyperbola x EE 
90 36 
ing straight lines: (a) x - Sy = 0, (b) 2x + y — 18 = 0, (c) x E +5=0. 


1 intersects the follow- 


8. Find the equation of the tangent to the hyperbola a - a = ] at the 


point (6, —5). x 2 
9. Draw the tangent with the slope k = 2 to the hyperbola a — — x]. 


10. Find whether we can draw a tangent to a hyperbola at any angle of incli- 
nation to the transverse axis. 


Ch. 5. Conic Sections 155 


11. Find the condition for the straight line Ax + By + C = 0 to touch the 
2 
hyperbola x E 1. 
d^ bt 
53 


1. Form the equation of the parabola touching the y-axis if its focus is at 
the point (6, 0) and the x-axis is its axis of symmetry. 

2. Form the equation of the parabola 

(a) symmetric with respect to the x-axis, touching the y-axis, and passing 
through the point (9, 1), 

(b) symmetric with respect to the y-axis touching the x-axis, and passing 
through the point (3, —3). 

3. Find the points at which the parabola y? = 3x and the following straight 
lines intersect: 

(a) x —3y + 6=0, (b) x- 29 +3=0, (c+) y+ x= -S. 

4. Find the points at which the parabola y? = 12x and the ellipse 
è y 
I ae 
25 16 

5. A chord which is perpendicular to the axis of the parabola y? = 2px is 
drawn through its focus. Find the length of the chord. 

6. Prove that the tangent to the parabola y? — 2px at the point (xo, yo) is 
defined by the equation yo = p(xo + x). 

7. Form the equation of the tangent to the parabola y? — 2px at a point 
with abscissa 3. 

8. Draw the tangent to the parabola x? — 5y parallel to the straight line 
2x - y 20. 

:3 Find the condition that the straight line y = kx + b touches the parabola 
y = 2px. 

10. Find the equation of the line formed by the midpoints of the ordinates 
of the parabola y? = 2px. 

11. An object thrown at an angle to the horizon describes an arc of a parabo- 
la and falls 16 meters away. Find the parameter p of the parabolic trajectory 
if the height reached by the object is 12 meters. 

12. Find the equation of the parabola if its minimal ordinate is — 4, the 
focus is at the point (4, 0), and the axis of symmetry is parallel to the y-axis. 


= 1 intersect. 


Chapter 6 


THE PLANE IN SPACE 


6.1. THE EQUATION OF A SURFACE IN SPACE. 
THE EQUATION OF A PLANE 


1. The notion of the equation of a plane. 
Suppose we have an equation in three variables 


F(x, y, 2)20 (1) 


We introduce a rectangular Cartesian coordinate system in 
space and consider x, y, z to be the coordinates of a point. 
Equation (1) is satisfied for some points (x, y, z), while for 
others it is not. The question arises as what is the set T of the 
spatial points that satisfy equation (1). Generally, the set T is 
a surface in space. Let us consider the case where we can express 
coordinate z from (1) as a function of x and y 


z = f(x, y) 


When the values x = xo, y = yo are fixed, we have a single value 
Zo = f (Xo, Yo) for z. Geometrically, this means that exactly one 
point from the set T lies on each straight line parallel to the 
z-axis. In other words, the set T and every straight line parallel 
to the z-axis intersect at a unique point. It is natural to consider 
such a set a surface in space. 

Definition 1. The surface defined by the equation F(x, y, 
z) = O is the set T of all points in space whose coordinates satis- 
fy this equation. 

Definition 2. The equation of a surface T given in space is 
the equation F(x, y, z) = 0 such that the surface defined by 
this equation coincides with T. 

As an example, we find the equation of a sphere centered 
at the point P(a, b, c) with radius R. 

The point M(x, y, z) belongs to the sphere we consider if 
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| a 


Figure 96 Figure 97 


and only if IPMI =R or 
V(x—ay + ( — by + (z— cP =R 


This relationship is the equation of the sphere. Squaring both 
sides we finally obtain 


(x — a} + (y — b)? + (2 - c)? = R? 


2. The equation of a cylindrical surface. 

Suppose we have a curve L and a straight line / in space. 
If we draw a straight line parallel to /, through every point of 
the curve L, then we obtain a surface (Fig. 96) called cylindri- 
cal. The curve L is the directrix and the lines parallel to / consti- 
tute the generatrix of the cylindrical surface. 

We choose a coordinate system in space so that the z-axis 
is parallel to the straight line / and assume that the directrix 
L is in the xy-plane and is defined by the equation 


F(x, y) = 0 (2) 


We now prove that (2) is also the equation of the given cylindri- 
cal surface in space. 

A point M(x, y, z) belongs to this surface if and only if its 
projection on the xy-plane, a point N(x, y, 0), lies on the curve 
L (Fig. 97), and this means that x and y satisfy (2). Thus, M 
is a point of the cylindrical surface if and only if its x, y coor- 
dinates satisfy (2). 
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Figure 98 Figure 99 


Similarly, we can establish that a cylindrical surface with 
generatrices parallel to the y-axis is defined by an equation of 
the form ®(x, z) = 0. If generatrices are parallel to the x-axis, 
then the equation of the surface is of the form ¥(y, z) = 0. 


Example 1. The equation y — x = 0, which does not contain 
the z coordinate, defines a cylindrical surface with generatrices 
parallel to the z-axis. The bisector of the first and third quad- 
rant angles in the xy-plane is the directrix (Fig. 98). 

Example 2. The equation (y — 1)? + (z + 2)? = 1, which 
does not involve the x coordinate, defines a cylindrical surface 
with generatrices parallel to the x-axis. The directrix is the circle 
centered at the point (0, 1, —2) with radius | and lying in the 
yz-plane. This surface is called the right circular cylinder 
(Fig. 99). 

3. Different ways of defining a plane. The equation of a plane 
passing through a given point and perpendicular to a given 
vector. 

A plane in space can be specified in several ways. For instance 
we can specify 

(a) a point Mo of the desired plane and a nonzero normal 
vector n to the plane, 

(b) a point Mo of the desired plane and two noncollinear vec- 
tors parallel to the plane, 
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M(x,y,z) 


Mo(Xo: Yo» Zo) 


Figure 100 


(c) three points of the desired plane Mi, M2, and Mi (it is 
assumed that the points are not collinear). 

We introduce a rectangular Cartesian coordinate system in 
space. Given a point Mo(xo, yo, zo) and a nonzero vector 
n — (A, B, C) in this system, find the equation of a plane 
a through Mo and perpendicular to n. 

Tlie point M(x, y, z) lies on the plane o if and only if the 
vectors MoM and n are perpendicular (Fig. 100), i.e. 


MoM-n = 0 (3) 


Since MoM = (x — Xo, Y — Yo, Z— Zo) and n = (A, B, C», 
condition (3) is equivalent to the equation 
A(x — xo) + B(y — yo) + C(z — zo) = 0 (4) 
which is the equation of the plane a. 
Thus, (4) is the equation of the plane through Mo(Xo, yo, 
zo) and perpendicular to n = (A, B, C). 


Example. Write the equation of the plane through the point 
Mo(1, 2, —3) and perpendicular to the vector n = (4, 7, 9): 


4(x — 1) + 7(y — 2) + 92+ 3) =0 
or, after simplifying, 
4x+7y+9z+9=0 


Theorem. Any plane in rectangular Cartesian coordinates is 
defined by the linear algebraic equation, that is, the equation 
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of the form 
Ax + By + Cz+D=0 (5) 


at least one of the coefficients A, B, C being nonzero. 

Conversely, any linear equation in which at least one of the 
coefficients A, B, C is nonzero defines a plane. 

n Given a plane œ in space. We consider a point Mo(xo, 
Yo, Zo) lying on the plane and a nonzero normal vector n = ( A, 
B, C) to the plane. According to what we have proved before, 
the equation of the plane a has the form in (4). We write (4) 
as Ax + By + Cz + D — 0, where D = -Axo — Byo — Czo, 
and see that the plane o is defined by a linear equation; since 
n # 0, one of the three numbers is nonzero. 

Conversely given equation (5), where one of the numbers 
A, B, C is nonzero, choose one of the solutions x — xo, y — yo, 
Z = Zo of this equation. For instance, if A » 0, then we can 
take the solution xo = —D/A, yo = 0, zo = 0. Then subtracting 
the identity 


Axo + By + Ca + D- 0 (6) 
from equation (5) yields 
A(x — xo) + BO — yo) + C(z — zo) = 0 (7) 


which is equivalent to the original ec uation (5). In fact, we ob- 
tained (7) from (5) by subtracting (6) from it; similarly, we can 
derive (5) from (7) by adding (6) to it. 

Accordingly equation (7) defines the plane through the point 
Mo(Xo, o, Zo) and perpendicular to the vector n = (A, B, C}, 
hence it follows that (5) also defines the plane. w 

Definition. An equation of the form in (5) where at least 
one of the coefficients 4, B, C is nonzero is called the general 
equation of the plane. 

Note that any nonzero vector n perpendicular to the given 
plane is called a normal vector to this plane. Obviously, a plane 
has a set of normal vectors, all of them being mutually col- 
linear. 

Let us concentrate on a fact we established when proving the 
second statement of the theorem, namely, if a plane is defined 
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Figure 101 


by the equation Ax + By + Cz + D=0, then n = (A, B, C) 
is a normal vector to the plane. 

4. Analyzing the general equation of a plane. 

Let us find whether a change in the coefficients A, B, C and 
in the number D in the general equation of a plane 


Ax + By + Cz+D=0 (8) 


changes the position of the plane with respect to the coordinate 
axes. 

(1) If D changes while the coefficients A, B, C do not, then 
the normal vector n = (A, B, C) is left unchanged: the plane 
is displaced parallel to itself in space (Fig. 101). 

For D = 0 the equation assumes the form 


Ax + By + Cz 0 (9) 


Obviously, the numbers x = 0, y = 0, z = 0 satisfy (9), hence 
it follows that (9) defines a plane passing through the origin. 

(2) A change in A, B, C makes the vector n rotate and, conse- 
quently, the plane itself. If one of the coefficients vanishes, 
then n becomes perpendicular to the respective coordinate axis, 
and therefore the plane is parallel to that axis. For instance, 
the equation By + Cz + D = 0 defines a plane parallel to the 
x-axis. 

If one of the coefficients and the constant term are zero, then 
the plane passes through the corresponding coordinate axis. 
For instance, the equation By + Cz = 0 defines a plane passing 
through the x-axis. 


11—366 
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(3) If two coordinates of the vector n are both zero, then 
n is parallel to that coordinate axis the projection on which 
is nonzero; the plane is perpendicular to this axis or, in other 
words, is parallel to the corresponding coordinate plane. 

Thus, the planes defined by Ax + D = 0, By + D = 0, and 
Cz + D = 0 are perpendicular to the x-, y-, and z-axes, respec- 
tively or, which is the same, are parallel to the yz, Xz, and xy 
coordinate planes, respectively. 

For D = 0 the last three equations assume the form x = 0, 
y = 0, and z = 0. These are the equations of the yz, xz, and 
Xy coordinate planes, respectively. 

5. Relative position of two planes. 

Suppose two planes a and @ are defined by the equations 


Aix + Biy c Cix  D-0 for a 
Ax + Boy + Caz + D 0 for f 


Three cases are possible here: (1) the planes coincide, (2) the 
planes are parallel but not coincident, and (3) the planes are 
not parallel, i. they intersect along a straight line. 

How can we determine from the equations which is the case? 
Case 3 can easily be established, namely for two planes not 
to be parallel it is necessary and sufficient that their normal 
vectors n; = (Ai, Bi, Ci) and m = (A45, B2, C2) be not 
parallel, i.e. the coordinates of n, be not proportional to those 
of m. For instance, the planes defined by 3x - y -z - 4-0 
and x — y + 2z — 5 = 0 are not parallel since the numbers 3, 
—1, 1 are not proportional to the numbers 1, —1, 2. 

Cases 1 and 2 are common in that m and nm; are collinear, 
ie. n; = Mni, where ^ is a number. The equations of the planes 
are 

Aix + By - Ciz D 20 for a 


Aix + ABiy +dCiz+ D2=0 for B 
If here the planes have at least one point in common, then 


the last two equations have the general solution xo, yo, zo. By 
substituting these coordinates into both equations, we obtain 
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two arithmetic identities. Multiplying the first by A and sub- 
tracting the result from the second yields D; — \D, = 0. Con- 
versely, if n? = An; and D: = AD, then the planes coincide. 

Thus, case 1 is characterized by the existence of a number 
A such that A2 = MA1, B2 = AB, C4 = AC), D: = AD, while 
for case 2 À is such that A2 = MA1, B; = XBi, Co ¥ AC), 
D; # XD. 

Example. The planes 2x + 2y — 4z + 6 = 0 and 3x + 3y — 
6z + 10 = 0 are parallel but do not coincide since 2/3 = 
2/3 = -4/(—6) = 6/10. 


6.2. SPECIAL FORMS OF THE EQUATION OF A PLANE 


1. The equation of a plane passing through a given point 
and parallel to two given vectors. 

Given a point Mi(xi, Yi, zi) and two noncollinear vectors 
a-(p,q,r»anda' = (p',q',r'»inarectangular Cartesian 
coordinate system. We assume that the vectors originate at Mı 
(Fig. 102). Let us form the equation of the plane a through 
Mı and parallel to a and a’. E 

If a point M(x, y, z) lies in the plane o then the vectors Mi M, 
a, and a’ lie in œ and hence are coplanar. Conversely, if M 
is such that the three vectors are coplanar, then M lies in the 
plane of a and a’, ie. in the plane a. 

The condition that three vectors are coplanar is, according 
to the theorem in Sec. 3.10, that the determinant composed of 
the coordinates of these vectors is zero. This condition has the 


Figure 102 
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p q F =0 (1) 


for the vectors MM, a, and a’. 

Consequently, (1) is the equation of a plane through the point 
Mia, Yı, z1) and parallel to the vectors a = (p, q, r} and 
a’ = (p, q', x 

Example 1. Find the equation of the plane through the point 
M,(2, 2, 1) and parallel to the vectors a = (3, 2, 5» and 
a’ = (1, —1, 0). 

O Substituting the data into (1) yields 


x-2 y-2 4-1 
3 2 5 =0 
1 -1 0 
To simplify this determinant we expand it by the elements of 
the first row: 


2.5 


(x —- 2) -10 


3 5 3 2 
[oof ient ae 
or 

S(x — 2) + S(y — 2) - 5(z - 1)= 0 


By removing brackets and dividing both sides by 5, we obtain 
the desired equation x + y- z — 3 = 0. e 

Example 2. Find the equation of the plane o through the 
point Mi(1l, —1, —4) and perpendicular to each plane defined 
by 3x — 2y + 3z + 5 = 0 and -5x + dy- z 41-20. 

O Since each of the given planes is perpendicular to the 
plane o its normal vector must be parallel to o. Then we have 
two vectors a = (3, —2, 3) anda’ = (—5, 4, —1). According 
to (1) the equation of o has the form 


x-1 y+1 z+4 
3 -2 3 =0 
-5 4 -1 
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Expanding the determinant yields 


emn] Se o«»|. sei peines ena 


or 
-10x — 1) - 14 + D + XZ + 4) = 


and the final result is the desired equation 5x + 6y — z — 3 = 
0. @ 

2. The equation of the plane passing through three given points. 

Given three noncollinear points Mi(xi, yi, zi), M2(%2, Y2, 22), 
and M3(x3, Y3, Z3) find the equation of the plane passing through 
these points. 

. Consider the vectors MM = (X2 — 1, »2 — Yi, Z2 — zi) and 
MiM; = (x3 — Xi, Y3 — Jı, Z3 — zi). which are, by assumption, 
noncollinear and lie in the desired plane œ (thereby are parallel 
to a). Thus, we can obtain the equation of the plane o as that 
of th the plane through the point Mı and parallel to the two vectors 
MiM and MM, ie. 


x= Op Bez 
Xx—-Xx »-xn 2-z|^70 Q) 
X3—X1 -J BZ- 


Thus, (2) is the equation of the plane through three points 
Mı, M2, M3, or the three-point form of the equation. 
Example. Find the equation of the plane passing through the 
three points Mi(1l, —2, —1), M2(2, 3, 0), and M3(6, 2, —2). 
O Substituting the coordinates of the points into (2) yields 
x—-1 y+2 zl 
2-1 +2 *t1|20 
6—1 +2 +1 
or 
x-1 y*2 z+1 
1 5 1 =0 
5 4 -1 
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Figure 103 


By expanding this determinant by the first row, we obtain 


(x - 1) -o*2 «nli 3 


5 1 
4 -1 


ia 
or 


—9x — 1) + 6y + 2) —-21(z- 20 


Removing brackets and dividing both sides of the equation by 
— 3 yields the final result 3x — 2y + 7z = 0. We can see that 
the plane passes through the origin. 

3. The intercept equation of a plane. 

If the plane intersects the three coordinate axes and does not 
pass through the origin, then it is convenient to write its equa- 
tion in intercept form. 

Given three points on the coordinate axes distinct from the 
origin, namely, A(a, 0, 0) with a » 0 on the x-axis, B(0, b, 0) 
with b #0 on the y-axis, and C(0, 0, c) with c #0 on the 
z-axis, find the equation of the plane passing through A, B, 
and C. 

We can take 


Ae ee 
"XL car 1 (3) 


as such an equation. In fact, (3) is a linear equation and, there- 
fore, defines a plane. By substituting the coordinates of A into 


a S ERRARE ERIS 


————— 
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(3) we ascertain that the plane does pass through this point: 


a 0,0 
ey ce 


Similarly, we can show that the plane also passes through B 
and C. 

Equation (3) is called the intercept equation of a plane since 
it defines the plane which cuts the x-intercept a, the y-intercept 
b, and the z-intercept c. 

The intercept form is very convenient and illustrates the posi- 
tion of a plane in space (Fig. 103). 

Any general equation of a plane Ax + By + Cz - D-0 
can be reduced to form in (3) if A, B, C, and D are all nonzero. 
We must transfer the constant term D to the right-hand side 
and then divide both sides by — D. The result is 

x y Z Zj 


ZD/A ” -D/B -D/C 


which is the intercept equation with a = —D/A, b = —D/B, 
and c= - D/C. 

Example 1. Reduce the equation of a plane 4x — 3y + 2z — 
12 = 0 to intercept form. 

©) H transforming the equation as above we get 2 - 
3y X = 
2.1 5 =i E ee m 1. © 

Example 2. Find the volume of the pyramid bounded by the 
plane defined in Example 1 and the coordinate planes. 

© We assume that the base of the pyramid is a triangle with 
the vertices O(0, 0, 0), A(3, 0, 0), and B(0, —4, 0) in the 
xy-plane and the altitude of the pyramid is the line segment 
OC with C(0, 0, 6). Using the formula for the volume of a 
pyramid and substituting the data yields 

1 3-4 


V=} Soh =3-- E x -6= 12. ® 
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6.3. DISTANCE BETWEEN A POINT AND A PLANE. 
ANGLE BETWEEN TWO PLANES 


1. Distance from a point to a plane. 
Given a plane a defined by the equation 


Ax + By + Cz+D=0 (1) 


and a point Mo(xo, Yo, zo) in space, find the distance d from 
Mo to a. 

This problem is similar to finding the distance between a 
point and a straight line, which we discussed in Sec. 4.6. There- 
fore we immediately write 


_ LAX» + Byo + Czo + DI 
VA +R +C 


We now formulate the following rule: in order to find the dis- 
tance from the point Mo to the plane a defined by the general 
equation (1), we must substitute the coordinates of Mo for the 
running coordinates x, y, and z on the left-hand side of (1) and 
divide the absolute value of the result by VA* 3 E +C. 

Example 1. Find the distance from the point Mo(1, 1, 1) and 
the plane defined by 2x + 2y - z - 3-0. 

O We have from formula (2) 


I2-1+2-1-—1+31_ 6 


d= =—=2 6 
Irr eye 


Example 2. Find the distance between two parallel planes de- 
fined by 2x + 2y — z + 3 = 0 and 2x + 2y -z -3 =0. 

O The easiest way is to choose a point Mo on one plane 
and then find the distance from Mo to the other plane. For in- 
stance, we choose the point Mo(l, 1, 1) on the second plane. 
According to Example 1, the distance from Mo to the first plane 
is 6. 0 

6. Angle between two planes. 

The angle between two planes is the angle between the nor- 
mal vectors to these planes. 

A normal vector to a plane can have either of two opposite 


d 


(2) 
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Figure 104 


directions and, therefore, the angle between two planes is mul- 
tivalued (Fig. 104); it may have two values, y and v — ¢. Since 
cos (x — c) = —cos y, we can find the cosine of the angle be- 
tween two planes from the formula where n; and nz are two 
normal vectors to the planes o and £ respectively. 

If the planes are defined by the general equations 


Aix+ By t Ciz * Dj 20 for a 
A2x + Boy + CozD;-20 for f 


then we can choose n, = (A1, Bj, Ci? and m = (A5, B5, C2) 
as the normal vectors. Then 


esas A142 + B1B; + CC 
VÀtJUBIASCIN AZ + B+ CG 


Corollary. For two planes to be perpendicular it is necessary 
and sufficient that the condition 


A,A2 + BBR +C =0 


is satisfied. 
In fact, if the planes « and f are perpendicular, then 
ni-n = 0. The converse is also true. 


6.4. HALF-SPACES 


We consider a plane o in space. The plane divides the entire 
space into two half-spaces and is their boundary. Every half- 
space is indicated by the parallel lines in Fig. 105. 
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Figure 105 


Suppose that the equation of the plane « is 
Ax + By - Cz+D=0 (1) 


We characterize the half-space defined by « in algebraic terms. 
One half-space is the set of points M(x, y, z) for which 


Ax + By * Cz D20 (2) 
and the other half-space is the set of points N(x, y, z) for which 
Ax + By+Cz+D<0 (3) 


a We first suppose that C z 0. Then equation (1) reduces 
to the form 
z=axt+byt+k (4) 


and (2) is equivalent to the inequality 

zpax+by+k (5) 
and (3) is equivalent to the inequality 

z<ax+by+k (6) 


Inequality (5) defines the half-space lying above the plane given 
by (4) (Fig. 106) and (6) defines the half-space lying below the 
plane. 

If C = 0, then A or B is nonzero; then reasoning in a similar 
way (but with respect to another coordinate, x or y) we can 
obtain the required result. m 
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Ma (xo,Yo 122) Z2 < Zo 


Figure 106 


Example 1. Given the plane 2x — 3y + 7z — 5 = 0 and two 
points M(4, —2, —2) and N(1, 3, 1), find whether the points 
lie on one side of the plane. 

© By substituting the coordinates of M and N into the left- 
hand side of the equation we have 2-4 — 3-(—2) + 7-(—2) - 
52 —5 and 2-1 — 3-3 + 7-1 — 5 = —S. Since the numbers 
have the same sign, we infer that M and N belong to the same 
half-space. 6 

Example 2. Given the plane a defined by the equation 
X t y — Az + 1 = 0 and two points P(l, 1, 1) and QQ, 2, 1), 
find whether the line segment PQ intersects the plane. 

O Substituting the coordinates of P and Q into the left-hand 
side of the equation yields 1 + 1—4 + 1 = —1and 1 - 2- 
4 + 1 = 1. Since the resulting numbers are different in sign, 
we infer that P and Q belong to different half-spaces with 
respect to the plane o. Thus, the line segment PQ intersects 
the plane. 6 


Exercises to Chapter 6 
6.1 
1. Describe the surfaces defined by the equations: 
(1) x = 0, (2) y = 0, (3) z = 0 (4) xy = 0, 
(5) xyz = 0, (6) x 22,00 xy - 5 =0, 


2 
& +7 =1 Ort 
a »b 
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2. Form the equation of the plane through the point P(— 1, 2, 3) and perpen- 
dicular to the radius-vector of the point. 

3. Find the equation of the plane through the point P(1, 3, —4) and perpen- 
dicular to the radius-vector of the point Q(— 1, 0, 2). 

4. Given three points M,(1, 7, —2), M2(4, —3, 2), and M3(3, 4, 2), find 
the equation of the plane through M, and perpendicular to the straight line 
M.Ms3. 

5. Find the equation of the plane through the point A(0, —2, —3) and 
parallel to the plane defined by x + Sy — 42 +2 = 0. 

6. Find the equation of the plane through the point A(l, 1, 1) and parallel 
to the xy-plane and through the point B(—1, 2, 7) and the x-axis. 

7. Find the values of p and q such that the following pairs of equations 
define the parallel planes 


Cd da. imd qui tcmpus eed 

a 

2x -4y -—qz+7=0, 2x —-3y * qz— 101-20 

8. Find whether the planes defined by px - y -z = Oandx+y+z=0 
coincide. 


6.2 


1. Find the equation of the plane passing through the origin and the points 
Mi(l, 2, 3) and M2(4, 5, 6). 

2. Find the equation of the plane 

(a) through two points Mi(— 1, 1, 4) and M2(6, 4, —3) and parallel to the 
x-axis. 

(b) through two points Mi(0, —1, 1) and M2(2, 1, 3) and perpendicular to 
the plane defined by 4x — 3y + 5z = 0. 

3. Find the points at which the plane defined by 7x — 5y + 2z — 14 = 0 
and the coordinate axes intersect. 

4. Find the intercept equation of the plane defined by 3x — Sy + 45z + 
15 = 0. 

5. Find the equation of the plane cutting the z-intercept c = —5 and perpen- 
dicular to the vector n = Q2, —7, 1). 

6. Find the equation of the plane through the point A( — 1, 1, 3) and cutting 
equal intercepts on the coordinate axes. 


6.3 


1. Find the distance from the origin to the plane 2x + 2y — z — 1 = 0. 

2. Find the distance between two parallel planes 6x + 3y + 2z = 5 and 
6x + 3y + 2z- 1. 

3. Find the equation of the planes parallel to the plane 6x + 3y — 2z + 
7 = 0 and a distance 5 away from it. 

4. Find the point on the x-axis equidistant from two planes 12x — 16y + 
15z + 1 = 0 and 2x + 2y- z- 1-0. 
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5. Given two faces of a cube lying in the parallel planes 
6x — 3y + 2z + 1 = 0 and 6x — 3y + 2z + 4 = 0, find the volume of the 
cube. 

6. Find the distance from the point P(1, 1, 1) to the plane passing through 
the points A(4, —1, —1), B(2, 0, —2), and C(3, —1, 2). 

7. Determine the altitude of the pyramid SABC dropped from the vertex 
S on the base ABC if S(O, 6, 4), AQ, 5, 3), B(-2, 11, —5), and C(I, —1, 4). 

8. The normal vector n = (2, —3, 5) to the plane o defined by the equation 
2x — 3y + 5z — 5 = 0 originates at a point on the plane. Find whether the 
terminus of n lies on the same side of the plane o with the origin of coordinates. 

9. Find the angle between two planes: 


(a4x—-Sy-3z-1-20 (b)3x-y+5z+2=0 
4x + Sy + 32 - 2-0 5x +3y—z+ 10=0 
10. Given two planes through the point M(— 5, 16, 12): one plane contains 


the x-axis and the other the y-axis, find the cosine of the angle between these 
planes. 


6.4 

1. Find whether the point M( —2, 1, 4) and the origin lie on the same side 
of the plane 3x - 7y + 22 - 1 2 Q. 

2. In each of the cases find whether the points M(2, —1, —1) and Nil, 2, 
—3) lie in the dihedral angle formed by the planes or in adjacent angles or 
in vertical angles 

(a)3x-—y+2z-3=0 (b)2x-y+5z-1=0 
x-2y-z44-0 3x«x-2y + 6z-1=0 


Chapter 7 


A STRAIGHT LINE IN SPACE 


7.1. EQUATIONS OF A LINE IN SPACE. EQUATIONS 
OF A STRAIGHT LINE 


1. A line as the intersection of two surfaces. 
A basic method of defining a line in spatial analytic geometry 
is its representation as the intersection of two surfaces. 

Suppose we have two surfaces defined by the equations 
F(x, y, z) = 0 and (x, y, z) = 0, respectively. Then, their inter- 
section consists of the points M(x, y, z) such that their coor- 
dinates satisfy both equations simultaneously. Naturally, we 
can adopt the following definitions. 

Definition 1. Suppose we have a system 


oe y, z= 0 
(x, y, 7) = 0 


The line L defined by this system is the set of all pdints of 
space whose coordinates satisfy the system. 

Definition 2. The equations of a line L in space are any two 
equations F(x, y, z) = 0 and ®(x, y, z) = 0 such that the line 
defined by the system of these equations coincides with L. 

2. Parametric representation of a line. 

Another way of defining a line is to represent it parametrical- 
ly. Suppose we have three arbitrary functions f(t), e(t), and 
y(t) defined on a set T (say, on an interval from a to b). Then 
the set of points M(x, y, z) whose coordinates are given by the 
formulas 


~ 


x = f(t), y = e(t), z = vt) (1) 


where the parameter ¢ (the argument) runs over the set 7, is 
a line defined parametrically and equations (1) are called the 
parametric equations of the line. 
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Figure 107 


Example. The line defined by the parametric equations 
x = Rcost, y = Rsint, z = t, where ¢ varies from 0 to 2, is 
a circular helix whose projection on the xy-plane is a circle cen- 
tered at the origin (Fig. 107). 

3. Parametric equations of a straight line. 

The position of a straight line in space can be given in differ- 
ent ways. For instance, a line can be determined by any two 
points on it, or by two planes which intersect along it, or by 
a point on it, or by two planes which intersect along it, or by 
a point on it and a vector parallel to it. Each has a different 
form for the equations describing the line. 

We assume any nonzero vector parallel to a straight line to 
be the direction vector of this line. 

Given a point Mi(xi, Yı, 21) and a nonzero vector a = (p, 
q, r) in a rectangular Cartesian coordinate system, find the 
equation of the straight line / passing through M; and whose 
direction vector is a. Obviously, a point M(x, y, z) lies on / 
if and only if the vector M M is collinear with a, i.e. when there 
is a number ¢ such that Mi M = ta. We write the last equation 
in coordinates: 


xX- X = ÍP, y — yı = tq, Z- Z =tr 
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Figure 108 


or 
x-xttpy-ycttíq,z-zeatr (2) 


Thus, relations (2) are the parametric equations of the 
straight line passing through the point Mi(i, Yı, 21) and whose 
direction vector is a = (p, q, r}. 

When the parameter ¢ varies from — to +œ, the point 
with the coordinates in (2) runs along the entire straight line. 

Example 1. Write the parametric equations of the straight 
line through the point Mi(— 1, 2, 5) and parallel to the vector 
a= (9, —2, 4). 

O Using formulas (2) we write the desired equations in the 
form 


x= -1+ 94, y=2-2t,7=5+4t. @ 


Example 2. Find the distance from the point P(l, 1, 1) to 
the straight line / defined by the parametric equations 


x--4At,y-2841,z2-26243t G) 


O In order to find the distance from the point P to the 
straight line / it is necessary to find a point Q(x, y, z) such 
that the vector PQ is perpendicular to /, ie. PQ L a, where a 
is a direction vector of the straight line (Fig. 108). We can take 
the vector a = (—4, 1, 3), whose coordinates equal the coeffi- 
cients of ¢ in equations (3), as the vector a. 

The coordinates of the vector PO arex-—l,y-—1,z-1. 
Using (3) we can write PO = (—4t — 1, 7 4 t, 5 + 31). 

The condition for the vectors to be perpendicular is that their 
scalar product is zero. Therefore, instead of PO L a we can 
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write 
(-1 - 40-(-4) + (7+. 9-14 (5 + 30320 


whence ¢ = —1. Thus, the value of the parameter ¢ correspond- 
ing to the point Q is —1. Consequently, the coordinates of Q 
arex = —4t24,y 28«4 127,27 2 6 + 3t = 3. We can now 
find the distance from the point P to the straight line / 


IPQI =V(4- 1} + (7 - 1? + 3 - 0? = V49 = 


4. Canonical equations of a straight line. 

The parametric equations of a straight line, i.e. equations (2), 
express the proportionality between the numbers x — x1, 
y — yn Z - zi (the coordinates of MiM ) and p, q, r (the coor- 
dinates of a). Thus, we can write them as 

AS XM y-—yr-z- (4) 


Equations (4) are the canonical (symmetric) equations of a 
straight line. 

We must clearly understand that (4) is a system of two 
equations 


XM VT Si 
p q 
X-X 4 c 
p r 


each of which is of first degree, that is the equation of a plane. 
Thus, the canonical equations define a straight line as a line 
along which two planes intersect. 

Strictly speaking, representing the equations of a straight line 
in form (4) is only meaningful when the three numbers p, q, 
r are all nonzero. Nevertheless, equations (4) can also be used 
when one or even two of the numbers are zero. Say, for p = 0 
we write 


x—X y-n z-Z (5) 


0 


"| 
“= 


12— 366 
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Let us show how this notation can be understood. We know 
that equations (4) express the collinearity of two vectors 
(x — X1, y — Yi, Zz — zi)? and (p, q, r). When p = O, collineari- 
ty means that 

y zy £: xd 


X= xX, = 7 


This is precisely what equations (5) mean. 
If p = 0 and q = 0, then we write (4) as 


0 0 r 
which we understand as a system of two equations 
X-2X5,y72M 


(explain why). 

Example 1. Find the canonical equations of the straight line 
through two points Mi(1, 0, —1) and M;(-2, 1, 2). 

O We take Mı M; = (—3, 1, 3) as the direction vector, then 
the canonical equations (4) assume the form 


x—1l y o zti 
-3 1|» 3 xs 


Example 2. Write the canonical equations of the straight line 
through the point (1, 0, 2) and parallel to the y-axis. 

© We take the unit vector j = <0, 1, 0) as the direction vec- 
tor of the line and write the canonical equations as 


em NA et 
0 = g 
Actually, these equations imply the system x = 1, z 2 2 (y 
is arbitrary). 


7.2. GENERAL EQUATIONS OF A STRAIGHT LINE 


1. Finding a vector perpendicular to two given vectors. 
This subsection is optional. We present a method for finding 
from two vectors in space n; and nz a third vector n perpendicu- 
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lar to each of the vectors. Here if n; and n; are not collinear, 
then n zx 0; otherwise n = 0. 

We introduce a rectangular Cartesian coordinate system in 
space and assume that the vectors n; and n; are specified by 
their coordinates: n; = (Ai, Bi, Ci), n2 = (A5, Bo, C»). 

Theorem. The vector n with coordinates 


B 1 Ci Ai Ci E B 1 
B; C2 A2 C2 A2 B; 
is perpendicular to each of the vectors nj and m; n = 0 if and 
only if nj and nz are collinear. 

Remark. Expression (1) for the coordinates of n can be easily 
remembered in the following way. We form the array 
Ay B 1 C 1 
A2B2 C 


(1) 


from the coordinates of n; and nz. Eliminating in turn the first, 
second, and third columns yields three square matrices 


Bi Gi\ (Ar C\ /Aı Bi 
B2 C; Aa C; EP B; 
whose determinants are the coordinates of the vector n; the sec- 
ond determinant should be multiplied by —1. 
o In order to ascertain that n and n; (i = 1, 2) are perpen- 


dicular, it is sufficient to verify that their scalar product is zero. 
We have 


Bi Ci A; Ci 

B; A B E 2j TÉ | 

= A1(B1C5 — B53C1) - Bi(A1C» - A2C) 
+ Ci(A1B; — A;Bi) 


A, Bi 


ma = Ar | A: Bi 


Collecting like terms on the right-hand side yields n;-n = 0. 
Similarly, we can verify that m-n = 0. 

If the vectors n; and nm are collinear, then their coordinates 
are proportional and then each determinant in (1) is zero. Hence 
it follows in this case that n is zero. 


pre 
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Conversely, let n = 0, ie. the three determinants in (1) are 
zero. We prove that in this case m and m are collinear. If 
nz = 0, then the proof is trivial: a zero vector is collinear with 
any vector. Let nz # 0, then at least one coordinate of nz, say 
A5, is nonzero. Since the second determinant in (1) is zero, it 
follows that its rows are collinear (as vectors in the plane). Be- 
cause the vector corresponding to the second row is nonzero 
(since A2 z 0), we have, by the theorem in Sec. 1.1, subsec. 4, 


A1-2À4, C=C 


where à is a scalar. Similarly, since the third determinant in 
(1) is zero, we have 


A = pÁ, Bi = pB 


where y is a scalar. It follows from the last two equations for 
A, that > = p (since Az # 0). Hence, 


A =), Bi =dB, Ci =r 


that is, n; is collinear with nz. = 

We denote the vector n, whose coordinates are given in (1), 
by m X m. 

Thus, we introduce a new operation of mutiplication for vec- 
tors in space. Whereas the scalar multiplication of two vectors 
yields a number, the new operation results in a vector and 
n, X m is the vector multiplication of nj and nz. 

Example 1. Show that 


ixj-2-kjxk-ikxi-j (2) 
where i, j, and k are the unit vectors. 
O Since i = (1, 0, 05, j = <0, 1, 0), and k = (0, 0, 1), the 
vector i x j has the coordinates 
0 0 1 0 1 0 
1 0 0 0’ 0 1 
ie. ix j= (0,0, 1) =k. 


The remaining two equations in (2) can be verified in a simi- 
lar way. @ 
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Example 2. Find ax b, with a= (l, 1, 1) and b= 
(1, -1, 1». 
O The coordinates of the vector a x b are 
1 1 1 1 1 1 
-1 1 1 1 -1 1 


, , 


ie a x b= (2, 0, 2). @ 

2. General equations of a straight line. 

Two nonparallel planes define a straight line along which 
they intersect. Consequently, the system of equations 


Axx + By - Ciz Di =0 Q3) 
Ax + By +O.z+ D,=0 


defines a straight line in space provided the vectors n; = 
(A1, By, Ci)? and m = (A5, Bj, C)? are noncollinear. 

Equations (3) are called the general equations of the straight 
line. 

We can transform the general equations to canonical ones. 
To do this it is necessary to choose the direction vector of the 
straight line and a point on that line. 

Let us show that we can take the vector n = n, X m» as direc- 
tion vector of the line defined by (3), ie. 


us Bi Ci A1 Ci 
E B; C; 


A, C 

We use a to denote one plane and £ to denote the other plane 
defined by (3). The vector n; = (A1, Bi, C1) is perpendicular 
to a and the vector m = (445, B2, C2) is perpendicular to £. 
Since n is perpendicular to n; and m (according to the theorem 
on p. 179), it is parallel to the planes a and 8, respectively, 
and consequently, to the straight line in which the planes inter- 
sect. In other words, n is the direction vector of the straight 
line defined by (3). Note that n # 0 since n; and m are not 
collinear by assumption. 

We now show how we can find the coordinates of the point 
through which the line defined by (3) passes. Since n # 0, at 


A Bi 
A, B 
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least one coordinate of n is nonzero. Let, for instance, 


A; Bi 


A B #0 


We write system (3) as 
Pa + Bıy = D,- Cız (4) 
A2x + Boy = D, - C2z 


and setting z equal to a number zi (say, zero), we find from 
(4) the values of x and y, viz., x = x1, y = yi. The three num- 
bers define the desired point. If we know the point Mi(xi, Y1, 
zi) on the straight line (3) and the direction vector n, then we 
can write the equation of that line 


Heu... . 2-0. 
Bi à| JA à| JAB 
B; C; A3 C A: B 


Example. Write the canonical equations of the straight line 


x+2y+3z-1=0 

pe m 6) 
O We first find the direction vector. We have 
Bi [o _ 2 3| i Ai Ci 1 d es ra 
B Q| |-1-1| "|4 G 1 -1 

A Bil _ {1 2|. 4 

A: Bi 1-1] 


thus, the direction vector is n = (1, 4, —3). Then we have to 
find a point on the line, that is, a solution of system (5). Setting 
z=0 yields the system 


x+2y=1 
x-y=-2 
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We solve it to find that x = —1, y = 1. Thus, (-1, 1, O) is a 
point on the straight line defined by (5). The canonical equa- 
tions of this line have the form 


xttl y-1 2 
1 4 aC na 


7.3. RELATIVE POSITION OF TWO STRAIGHT LINES 


Suppose two straight lines /; and b are defined in a rectangu- 
lar Cartesian coordinate system by canonical equations 


Xe AN a Ya oe 
Pı qı rı 
X732. VS 2 eS 
P2 q2 rr 


Two cases are possible here: 

(1) h and h lie in the same plane, and 

(2) h and b lie in different planes, that is, they are skew. 

Let us find when case 1 occurs. We consider three vectors 
a; = (pi, q1, ri? and az = (p2, q2, r2) which are the direction 
vectors of the straight lines /; and b, and Mi M, = (x — xi, 
ya — Yı, Z2 — zi)? (Fig. 109). For the lines /, and 5 to lie in the 
same plane, it is necessary and sufficient that the three vectors 
are coplanar. 

The condition that three vectors are coplanar (see Sec. 3.1) 
is that the third-order determinant composed of the coor- 


ag 
Mp 
a, 
lo M, 


Figure 109 Figure 110 
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dinates of the vectors is zero: 


X3—X1 Jy- yi 22—Zi 
p qı n -0 (1) 
P2 q2 Le) 


Thus, the straight lines |, and h lie in the same plane if and 
only if condition (1) is satisfied. 

Whence the condition that straight lines are skew: the 
straight lines are skew if and only if 


X3—X1 -y 22—4Zi 
p qi n #0 
pz q2 r 


If straight lines lie in the plane, then they intersect, or are 
parallel and do not coincide, or coincide. When the lines coin- 
cide, the three vectors M@,M2, a;, and a; must be collinear. 
When the lines are parallel, vector a; is collinear with az but 
they are not collinear with Mı M: (Fig. 110). If the lines inter- 
sect, then a; and a2 are not collinear. 

Example. Find the relative position of the straight lines 


Xo lp i4 Xe2. py-2 1-3 
1 2 IRA 3 1 2 
O We have 
A 2-1 2-0 3-2 12 4 
1 2 1 =l 2) 7) £9 
3 1 2 312 


thus, the lines are in the same plane. The vectors a; = (1, 2, 
1) and a; = (3,2, 1) are not collinear and hence the lines inter- 
sect. In order to find the intersection point, we write the equa- 
tions of the first straight line in parametric form: x = 1 + f, 
y =2t, z 2 2 + t, and substitute them into the equations of 
the second line. We have the following system of equations 


t-1 2t-2 t-1 
3 i 2 
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l2 


Figure 111 


Since the straight lines intersect, the system must be consistent. 
Solving it yields t = 1. Hence the coordinates of the point of 
intersection are x = 2, y= 2, z = 3. 0 

1. Angle between two straight lines. 

The angle o between two straight lines in space is the smallest 
of two adjacent angles formed by two straight lines drawn 
through an arbitrary point in space and parallel to the two given 
lines (Fig. 111). 

Let a1 = (pi, q1, r1? and az = (pz, q2, r2) be direction vec- 
tors of the given lines. The angle between these vectors either 
equals the angle y between the lines or is the supplement of 
yg. Therefore 


cos e = Puuk I IPiP + qı + nni 
tails? Pi + qi + rANp +g +r 


If the straight lines are perpendicular, then cos e = 0 (the con- 
verse is also true), then the condition that the straight lines are 
perpendicular has the form 


Pih + dido + rnn = 0 


If the straight lines are parallel, then a; and az are collinear 
(the converse is also true), then the condition that the lines are 
parallel is 
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7.4. RELATIVE POSITION OF A STRAIGHT LINE 
AND A PLANE 


The following cases are possible for a straight line / and a 
plane o in space: 

(1) a straight line / intersects a plane a, i.e. they have a point 
in common, 

(2) l is parallel to a but does not lie in o, in this case they 
have no points in common, 

(3) / lies in o. 

Let us find how to distinguish between these cases if the 
plane « is defined by the equation 


Ax + By+Cz+D=0 
and the straight line / by the canonical equations 


We consider two vectors n = (A4, B, C) and a = (p, q, r}, 
one of them being perpendicular to the plane « and the other 
to the straight line /. 

Obviously, the line / is parallel to « if and only if a is perpen- 
dicular to n, that is when the scalar products of these vectors 
are zero. Consequently, 


Ap +Bq+Cr=0 (1) 


is the necessary and sufficient condition for the straight line 
l and the plane « to be parallel. Now, if this condition is not 
fulfilled, ie. if 


Ap + Bq* Cr z0 


then | and a intersect. 

Suppose that the straight line / is parallel to the plane a, i.e. 
condition (1) is valid. 

The straight line lies in the plane a if and only if a point 
Mi(xi, Yı, Z1) on / lies in this plane. Consequently, the neces- 
sary and sufficient conditions that the straight line / lies in the 


| 
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plane « can be written as 
Ap + Bqg+Cr=0 
Axı + By; + Cc + D=0 
Example. Let-us consider the straight line 
4S yr 2 Eis 
5 3 -1 
and the three planes 


3x —4y+7z+4=0 (a) 
x—-—y+2z+3=0 (8) 
X—-yt2z72—5-0 (y) 


The straight line / intersects the plane « since 3-5 + (—4)-3 + 
7-(—1) # 0. At the same time, the line is parallel to a plane 
B since 1-5 + (—1):3 + 2:(— 1) = 0, but does not lie in 8 since 
the point Mi(3, —2, 0) on this line does not belong to £. Finally, 
I lies in the plane y since / is parallel to y and Mi belongs to y. 

1. Angle between a straight line and a plane. 

The angle 0 between a straight line and a plane is the angle 
formed by the line and its projection on the plane. The angle 
0 varies from 0 to 90°. 

Let a plane be defined by the equation 


Ax + By +Cz+D=0 
and a straight line by the canonical equations 


de. QE Pc: s 


p q r 
The vector n = (A, B, C) is perpendicular to the plane and 
the vector a = (p, q, ry is parallel to the straight line. 
We denote the angle between the vectors n and a by ¢. Figure 


112 shows that o = 90° + 6, hence sin = Icos ol. Whence we 
have 


sinü = |——-— 
Inl - la! 
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Figure 112 


or in coordinate form 
|Ap + Bq + Crl 
VA +B + CVpe ger 


We can find from (2) the angle between a straight line and 
a plane using the coefficients of their equations. 
Example. Find the angle between the straight line 


E 


sin@ = 


(2) 


15x -2y —4z-8=0 


and thé plane 6x + 15y — 10z + 31 =0. 

O We first find the direction vector of the line. To do this, 
according to Sec. 7.2, we should take the normal vectors of the 
planes defining the line, ie. m = (6, —2, —1) and m = (15, 
—2, —4), and then find the vector a = n; x n2. We have 


ác -2 -1 _]6 =1 6 -2 

~ M-2 -4[' 15 —4 15 -2 
or a = (6, 9, 18). To simplify our calculation still further, it 
is convenient to divide the coordinates of a by 3; we have 


a’ = (2, 3, 6). Using (2) we find the sine of the angle between 
the straight line and the plane: 


16-2 + 15-3 — 10-61 3 3 


sin ĝ = = c 
Je + 157 + (102 V2 33:68 197 B 


From trigonometric tables we find that 0 = 1°18’. @ 
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Exercises to Chapter 7 
7.1 


1. Find the equation of the straight line through the origin and the point 
(a, b, c). 

2. Draw a straight line through the point (3, —2, 7) and parallel to the 
sud F+ azt 

-3 5 

3. Given the equations of motion of a point x = 1 — 6f, y 2 2 + 3t, 
z = 5 — 2t, find its velocity v. 

4. Given the equations of motion of a point x = 3 — 24, y= —4 * f, 
z = 1 + 2t, find the distance covered by the point in the time interval from 
ti =O0toh = 5. " 

5. Draw a straight line through the point (—1, 2, 4) and perpendicular to 
the plane 3x + 2y - 7Zz c 5 - O. 

6. Find the projection of the point (1l, —1, 6) on the plane 
5x-y-22- 8-20. 

7. Write the equations of the sides of AABC with vertices A(l, 1, 1), 
B( —2, 3, 4), and C(4, —5, 5). 

8. Set up the equation of a plane passing through the point (— 1, 0, 1) and 
the straight line x = 1 + 55, y= -4+24,z= -1- f. 


i A 
straight line 


9. Write the equation of the projection of the straight line eal = 
ycd lec E 
6 -] 
10. Write the equations of the perpendicular from the point (2, 4, 1) onto 
the straight line = E = p ES cmd A 
5 -1 2 
11. Find a point symmetric to the point (6, 5, — 4) with respect to the straight 
x2 y-3 z+5 


2 
- on the xy-plane. 


12. Find the distance between two straight lines 
x y*3.:-] and pay »xlo xc 
5 -2 1 5 -2 1 
13. Given a cube with side equal to 1, find the distance from a vertex to 
the diagonal not passing through it. 
14. Verify whether a plane perpendicular to the diagonal of a cube and pass- 
ing through its midpoint intersects the cube along a regular hexagon. 


72 
1. Reduce the equations of a straight line 
Xty-z-6-0 
Pul lly + 2z + 27=0 
to canonical form. 
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2. Prove that the straight lines 
= Xty-z-0 
x*2 y 1 Z ànd DER 
x-y-—5z-8=0 


are parallel. 
3. Describe relative positions of the straight lines: 


(1) (5x+7=0 (2) (3x -1=0 (3) (Aix + By + Ciz - 0 
3y -2=0 2y + 5z=0 Aix + By + Qz = 0 


4. Form the equations of the projection of the straight line 


x~y-z=0 
2x+3y+z2-5=0 
on the yz-plane. 


5. Form the equations of the projection of the straight line 


x—-4y+2z-5=0 
3x+y-z+2=0 


on the plane 2x + 3y + z-6=0. 


7.3 
1. Prove that the straight lines ~ : VEERE 


z 
2 -5 
y-276-5tz- —6 + t lie in the same plane and find the equation of the 
plane. Find the point at which the lines intersect. 

2. Given the straight lines 


and x = —3 + 71, 


find a such that the lines intersect. 
3. Find whether the straight lines 


3x + 5y c 5z—- 3-0 x-2 y-6 z+7 
x+ S5y+ 14=0 1 3 -4 


intersect. 
4. Find the angle between the straight lines 


roy 95 ED e P Zt 
3 6 2 2 9 6 
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1. Given the straight line through the points ‘x;, yi, 0) and (x2, 0, zz), find 
the point at which this line and the yz-plar tersect. 
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2. Find the value of A for which the plane Ax — 3y + 7z — 2 = Ois parallel 
P F -4 
to the straight line 7 l =? = ^n 
3. Find the values of A and B such that the plane Ax + By + z- 9-0 
is perpendicular to the straight line i =2 H = x : 
4. Find the point at which the straight line eee dict 2 m na l 
plane 3x - 5y - 10z - 6 = 0. 2 5 -3 
5. Find the point at which the straight line 


ae 


and the 


5x —-3y + 2z-15=0 


and the plane 10x — lly + 5z — 36 = 0 intersect. 
6. Find the sines of the angles formed by the straight line 


4x —6y + 32 + 18=0 
x-z+3=0 


and the coordinate axes. 


Chapter 8 


QUADRIC SURFACES 


Any surface whose equation in rectilinear Cartesian coor- 
dinates has the form 


(x, y», z) = 0 
where ®(x, y, z) is the second-degree polynomial in x, y, z, i.e. 


(x, y, z) = Ax? + By? + Cz? + Dxy + Exz + Fyz + 
+ Gx+ Ay+Kz+L 


is called a quadric surface. 
We shall not consider cylindrical surfaces, i.e. those defined 
in some coordinates by an equation of the form 


$(x y) = 0 


(its left-hand side is independent of z). Then, as can be shown, 
the equation of a surface in convenient coordinates reduces to 
one of the two forms: 


Ax? + By) + Cz? + L = 0 with A, B, C, L nonzero (I) 
Ax? + By? + Kz = 0 with A, B, K nonzero aD 
Ellipsoids and hyperboloids are the most interesting exam- 


ples of surfaces defined by (I), and the graphs of (II) are 
paraboloids. 


8.1. THE ELLIPSOID 


Definition. The ellipsoid is a surface defined by the equation 
in convenient rectangular coordinates: 
èl yg 
Staatl a0, b0 c0 1 
zc b e 0 
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Figure 113 


In order to visualize the shape of an ellipsoid and to sketch 
it in a plane, we use the method of parallel sections, that is 
examine its intersections with various planes. 

Let the cutting planes be parallel to the xy-plane, each of 
them being defined by the equation z = h (= const). The line 
of intersection of the plane and an ellipsoid is determined by 
the system of equations 


x? y? zZ xi 2 k? 


We rewrite it as 
i + Ee ER =1 
d-m/c) PA- RA 
z=h (2) 


If |hl <c, ie. —c < h <c, then 1 — h?/c’ is positive and the 
section is an ellipse (Fig. 113) with the semiaxes 


2 na | he 
aq, = 1 c bi - b fl oe 


The semiaxes have maximal values when A = 0, namely a; = a 
and b; = b. 


If h increases from 0 to c or decreases from 0 to — c, then 
a, and b, decrease from a to 0 and from b to 0, respectively. 


13—366 
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When A = c or h = —c, the sections of the ellipsoid are the 
points (0, 0, c) and (0, 0, —c). When |A| > c, the first equation 
in system (2) has no solution since its left-hand side is nonposi- 
tive; consequently, there are no points belonging to the ellipsoid 
outside of the strips between the planes z = c and z= —c. 

Similar results are obtained when the ellipsoid and the planes 
parallel to the xz- and yz-axes intersect. 

Note once more that the intersection of the ellipsoid defined 
by (1) and the xy-plane is the ellipse 


y=0 


and this ellipse has the longest semiaxes (a and 5) out of all 
its sections by the planes parallel to the xy-plane. Similarly, the 
intersection of the ellipsoid and the xz-plane is the ellipse 

re 

“+75 =1 

¢ c 


x=0 
and it has the longest semiaxes (a and c) out of all its sections 
by the planes parallel to xz-plane. Finally, the intersection of 
the ellipsoid and the yz-plane is the ellipse 
2 2 


J Zoa 
po xl 
x=0 


It has the longest semiaxes (b and c) out of all its sections by 
the planes parallel to the yz-axis. 

The quantities a, b, and c are the semiaxes of the ellipsoid 
defined by (1). If a # b, b zc, c z a, the ellipsoid is called 
tri-axial or symmetric. 

When two of the semiaxes are equal, the ellipsoid is a surface 
of revolution. For instance, if a — b, the equation of the ellip- 
soid assumes the form 

2 2 
VELAT 
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Figure 114 


and its section by the plane z = A, with lhl < c, is a circle 
with center on the z-axis, and the ellipsoid is a surface of revo- 
lution whose axis is the z-axis. Such an ellipsoid is called an 
ellipsoid of revolution (or spheroid). 

Finally, when the three semiaxes of an ellipsoid are equal, 
the equation of the ellipsoid assumes the form 

2 2 
5 + 3 + = =1 


ie. it is a sphere with the equation 


x»-y.2g224 
82. THE HYPERBOLOID 


1. The hyperboloid of one sheet. 
Definition. The hyperboloid of one sheet is the surface de- 
fined by the following equation in convenient coordinates: 
2 y g 
—.-S=la>0,b>0,c>0 (1) 
Ce e 


13* 
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By cutting the hyperboloid defined by (1) by planes parallel 
to the coordinate planes, we can find its shape in a similar man- 
ner as we did in the case of an ellipsoid. 

The section of the hyperboloid in (1) by the plane z = h is 
a line defined by the equation 


ie. is an ellipse (Fig. 114) centered at the point (0, 0, ^) and 


whose semiaxes are a; = av 1 + A?/c? and bj = bv 1 + h2/c?. 
When A = 0, the semiaxes have minimal values a; = a and 
bi = b; as | hl increases, i.e. as the cutting plane moves away 
from the xy-plane, the values of the semiaxes increase in- 
definitely. 

The intersection of the surface in (1) and the xz-plane is the 
hyperboloid defined by the equations 


y=0 


The section of the surface by the yz-axis is the hyperbola with 
the equations 


Vm E 
pm č 
x=0 


It can be shown that any section of the surface in (1) by planes 
passing through the z-axis is a hyperbola. 

The hyperboloid of one sheet given by (1) opens indefinitely 
along the z-axis as |z! grows; the section by the plane z = 0 
is the “neck”. 

When a = b, equation (1) becomes 


| 
| 
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Figure 115 


Sections of this surface by the planes z = h are circles with 
centers on the z-axis. Consequently, in this case the hyperboloid 
is a surface of revolution about the z-axis. Such a surface is 
called a hyperboloid of revolution of one sheet. 

2. Hyperboloid of two sheets. 

Definition. The hyperboloid of two sheets is the surface de- 
fined by the following equation in rectangular coordinates: 

2 2 
s+% -i a>0, b>0, c>0 (2) 

This equation differs from (1) in the sign of the right-hand side. 
In order to find the shape of the surface in (2), we consider 
its sections by the planes z = h. These sections are lines lying 
in the planes z = h and defined by the equations 


x 2 n 
sg a] 
z=h (3) 


When lAl < c, the right-hand side of (3) is negative; conse- 
quently, the section is “empty”. Thus, there are no points be- 
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longing to the hyperboloid in (3) in the strip between the planes 


z= cand z = -c. When lhl = c, ie. when z = cor z = -c, 
the only solution of (3) is x = 0, y = 0. Hence it follows that 
the sections by the planes z = c and z = —c are the points (0, 


0, c) and (0, 0, —c) respectively. When lh! >c, equation (3) 
defines an ellipse with the semiaxes a1 = aVh?/c? — 1 and 
bı = bVh?/c* — 1 in the plane z = h. As lhl increases from 
c to infinity, a, and b, increase from zero to infinity. 

Thus, the surface (2) consists of two sheets, one lying in the 
subspace z > c and the other in the subspace z < -c (Fig. 115). 

The sections of the surface by the xz- and yz-planes are the 
hyperbolas 


2 2 2 
5-5. 5-522 
a c and b c 


It can be shown that any section by a plane passing through 
the z-axis is a hyperbola. 

For a = b (2) is a surface of revolution about the z-axis, 
which is called a Ayperboloid of revolution of two sheets. 


8.3. THE PARABOLOID 


1. The elliptic paraboloid. 
Definition. The elliptic paraboloid is the surface defined in 
rectangular coordinates by the equation 


2 
Cane 


zie d OPES U W 


This surface and the plane z = A can only intersect when 
h > 0. For A = 0 the section is a point, the origin. For h > 0 
the section is the ellipse defined by 
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Figure 116 


with the semiaxes a, = aVh and bı = bVh. As h increases from 
zero to infinity, a» and bı also increase from zero to infinity. 
Let us consider the section of the surface (1) by the xz-plane. 


This section is defined by the equations y = 0 and 23 x’ =z, 
a 


which is a parabola with the z-axis and lying in the plane y = 0. 
Similarly, the section by the yz-plane is the parabola x = 0, 
1 
b 
be shown that any section by a plane passing through the z-axis 

is a parabola with the z-axis. 
It is now clear why the surface (1) is called the elliptic 
paraboloid (Fig. 116). For a = b equation (1) assumes the form 


x! 4 y! 2 az Q 


In this case the sections by the planes z = A, where h > 0, are 
circles with centers on the z-plane. This means that (2) is a sur- 
face of revolution with the z-axis and is called an elliptic 
paraboloid of revolution. 
2. The hyperbolic paraboloid. 
Definition. The Ayperbolic paraboloid is the surface defined 
in rectangular coordinates by the equation 
2 
X3 -na»0b»0 (3) 


y? = z with the z-axis and lying in the plane x = 0. It can 
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The section by the plane y = 0 is the parabola 


y=0,2= 4x (4) 


with the vertex at the origin and the z-axis as the symmetry 
axis. Since here 1/a? > 0, the parabola lies in the half-plane 
y =0, z>0, ie. the parabola opens upward (with respect to 
the z-axis). 

The section by the plane x = 0 is a parabola with the 
equations 


x=0,z=—- y (5) 


ie. its vertex is at the origin and the parabola is symmetric 
about the z-axis; it opens downward since here —1/b? < 0. 

In order to visualize the shape of the surface in (3), clearly, 
we consider the section by the plane x = h, where iis a number. 
This is the parabola 


which results from the parabola in (5) by displacing it upward 
(i.e. in the positive direction of the z-axis) by h?/a?. The vertex 
of the new parabola has the coordinates 


2 
x=h y=0,2=4, 
a 


ie. it lies on the parabola in (4). 

Thus, in order to obtain the surface (3) we should use the 
parabolas (4) and (5), which lie in perpendicular planes and 
have their vertices and the axes of symmetry in common, one 
parabola opening upward and the other opening downward. 
Then one of them should be translated in space so that its ver- 
tex moves along the other parabola. The resulting surface is 
saddle-shaped (Fig. 117). i 

Let us consider sections of this surface by the planes z = A. 
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Their equations 


show that the section by the plane z = A for h # Ois a hyperbo- 
la. If A > 0, then the transverse axis of the parabola is parallel 
to the x-axis (the straight line z = h, y = 0), while for h < 0 
the transverse axis is parallel to the y-axis (the straight line 
z= h, x = 0). When h = 0, the section is defined by 


à oy 
Z = S — -L = 
a b 
i.e. it degenerates into a pair of straight lines ~ - B = 0 and 


F + A = 0 lying in the xy-plane. 


Part Two 


LINEAR ALGEBRA 


The first part of this book dealt with the elements of analytic 
geometry in the plane and in space on the basis of the ideas 
of coordinates and vectors. Let us remind that we began 
manipulating vectors after introducing two operations, namely, 
vector addition and the multiplication of a vector by a scalar, 
which follow the laws of arithmetic. 

Mathematics and its applications often involve objects which 
cannot be presented geometrically but admit operations with 
them like those indicated above for vectors. Such objects are 
also called vectors, but now the term has a wider sense. As 
in the case of ordinary vectors, these objects can be defined 
by sets of numbers—coordinates—with the difference that 
there may be any number of coordinates (not necessarily two 
or three). The study of such objects is called the theory of vec- 
tor spaces or linear algebra. 

We begin the second part of the book with a chapter on sys- 
tems of linear algebraic equations. The chapter is optional, but 
is necessary for presentation and is practically important. 


Chapter 9 


SYSTEMS OF LINEAR EQUATIONS 


This chapter considers systems of first-degree algebraic equa- 
tions. Special cases of such systems were discussed in Chapter 
3 when studying systems of n linear equations in n unknowns. 
In what follows the number of equations and the number of 
unknowns are not necessarily equal; both numbers are ar- 
bitrary. 

Algebraic first-degree equations are called /inear since a first- 
degree equation in two unknowns defines a straight line in the 
plane. 


9.1. ELEMENTARY TRANSFORMATIONS OF A SYSTEM 
OF LINEAR EQUATIONS 


Let us revise the notation introduced in Chapter 3. We 
denote unknowns by xi, x, ..., the coefficients of the 
unknowns in the ith equation of the system by ai, di», ... , 
and the constant term in the ith equation by b;. In this notation 
the general form of the system of m linear equations in n 
unknowns (an m X n system) is 


1X1 + diaXa + ... + QinXn = bı (Ist equation) 
@21X1 + G22X2 + ... + dànX4 = b2 (2nd equation) () 
GmiXi + Am2X2 + ... + AmnXn = Om (mth equation) 


Systems can contain equations of the form 
0-x + O-X%2 +... + Ox = b 


Obviously, for b = 0 this equation is satisfied by any set of 
unknowns, while for b # 0 no set of unknowns satisfies it. 
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The solution of system (1) is a set of n numbers a, 
02, ... , Œn Which when substituted for the unknowns in each 
equation of the system (a) for x1, a2 for x2, and so on) trans- 
form all the equations into valid numerical equalities. 

A system is said to be consistent if it has at least one solution 
and inconsistent if it has no solution. 

We later show that three cases are possible: 

(1) the system is inconsistent, 

(2) the system has a single solution, and 

(3) the system has infinitely many solutions. 

It is impossible to have a finite number of solutions greater 
than one. 

To analyze a system of equations means to establish whether 
it is consistent. If the system is consistent and has a single solu- 
tion, we should determine the solution; when the number of 
solutions is infinite, we should find the set of solutions using 
the most efficient method. 

A method for solving systems of linear equations, which is 
both sufficient and suitable for drawing theoretical conclu- 
sions, by a systematic process of elimination of unknowns is 
called Gaussian elimination or the Gauss algorithm. 

When solving a system of linear equations, we shall always 
start with a system written in the form in (1), namely, the order 
of terms in the equations of the initial system is preset: the 
first term is that with the unknown xı, the second is with x2, 
and so on. 

This order can be violated in the process of transforming the 
system, since the following operations are possible: 

(a) interchanging two terms in all the equations. For instance, 
interchanging the terms with x? and x4 in the system 


e eee: 


yields 3x1 + 6x2— X3 t x4 = 3 


5x1 — X4 + 4x3 — 7x2. = 2 
3x1tx4— xt 6x2 = 3 


Thus, a transformation of type in (a) is an identity transfor- 
mation, 
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(b) eliminating an equation of the form 
Ox + O-% +... Ox = (2) 


from the system, 

(c) adding a constant multiple of one equation to another 
equation. 

Definition. The operations (a), (b) and (c) are called elemen- 
tary operations. 

The following proposition is true. 

A system obtained from an initial system by finitely many 
elementary operations is said to be equivalent to it. 

This is obvious for (a) and (b), while (c) requires additional 
arguments. 

Suppose we have the system 


(3) 


djjXi + à, = bi 
Q21X1 + a22%2 = D 


We transform the system as follows: add both sides of the sec- 
ond equation multiplied by a number c to the first equation. 
The result is the new system 
(à11 + Ca21)X1 + (a2 + Car2)%. = By + chr (4) 
Qux + G22X%2 = br 


Every solution of the initial system (3) satisfies the new sys- 
tem (4). On the other hand, we can obtain (3) from (4) using 
an operation similar to (c). To do this we add the second equa- 
tion in (4) multiplied by —c to the first equation. Hence it fol- 
lows that every solution of the new system satisfies the initial 
system. Thus, the systems are equivalent. 


9.2. GAUSSIAN ELIMINATION 


1. Reducing a system to the echelon form. 

The essence of Gaussian elimination is to use elementary 
operations to reduce a system of equations to a form showing 
all its solutions. Let us assume that if the algorithm involves 
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the equation 
O-x1 + 0-2 +... + Ox, = 0 (1) 


then the equation is discarded, that is, a type (b) elementary 
operation is applied. 
Let us consider the system of linear equations 


a11Xı + di2X2 + ... + AinXn = bı 
G21X1 + 22% + ... + ArnXn = b2 (2) 
Ami X1 + Am2X2 + ... + AmnXn = Dm 


Two cases are possible here. 

Case 1. The system involves an equation in which all the 
coefficients of the unknowns are zero, while the constant term 
is nonzero: 


Oxi + 0-x2 +... +O, b (3) 


where b is nonzero. No set of unknowns can satisfy (3) and 
therefore a system containing such an equation is inconsistent. 

Case 2. The system has no equations like (3) with b # 0. In 
other words, each equation has at least one nonzero coefficient. 
Let us dwell on this case. Suppose that ai, # 0. Then we can 
eliminate x1 from all the equations beginning from the second 
one by adding the first equation multiplied by —a21/a1: to the 
second equation, the first equation multiplied by —431/a1: to 
the third equation and so on. The resulting system is 


QiiXi + di2X2 + ... + AinXn = D1 


GjaXo o... + AnXn = bj 


The first step is completed. The boxed equations of the system, 
which remain to be transformed, are the residual part. 

Let us apply step 1 to the residual part. As a result we either 
find an equation of type (3) (where b # 0) thereby establishing 
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that the system is inconsistent, or assuming az # 0 perform 


the next step, ie. arrive at the system 


Q11X1 + d12X2 + d13X3 + ... + AinXn = bi 
d2:,X2  23X3 + ... + Ginkn = bi 


asus Q3nXn = by 


nos + GmnXn = bm 


The second step is completed. The residual part contains one 
equation less. 

Continuing the process we arrive at one of two cases. We 
either obtain a system whose residual part contains an equation 
of the form 0-x1 + ... + 0-x, = b, with b #0, the system 
thus having no solution (being inconsistent) and there is no rea- 
son to transform it any further. Or, since the number of steps 
cannot exceed n (the number of unknowns), we finally obtain 
a system having no residual part, that is, to a system of the form 


biixi + Dbiaxa + ... + Dir, + ... + Dinkn = C1 
D23Xa + ... + BarXc + ... + DanXn = C2 


where the coefficients b11, b22, ... , brr are nonzero. The num- 
ber of equations may be different to system (2) (7 equations 
instead of m initial ones) because equations of form (1) are 
discarded. 

We shall call the system in (4) to be in the echelon form due 
to the appearance of the system. Just look at system (4), the 
broken line beneath the first terms is like a staircase. 

Now, let us see what we should do if the assumptions 
431 # 0, az» Æ 0, and so on, required when obtaining (4) are 
not fulfilled. 

At each step the coefficient of the first unknown in the first 
equation is called the pivotal coefficient. In system (2) the pivo- 
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tal coefficient is a11. If a); = 0, then one of the coefficients 
in the first equation is nonzero, say a12. Then we interchange 
terms with x; and x» in all the equations (a type (a) elementary 
operation) to make the pivotal coefficient nonzero. Then we 
eliminate x2, and not xı, from all the equations of the system. 
Similarly we may interchange terms at each step of the al- 
gorithm to ensure that the pivotal coefficient is nonzero. The 
resulting system is not necessarily of form (4); it may differ 
in the subscripts of the unknowns, but it will still be in the 
echelon form. 

Let us summarize. 

Theorem. Jf the elementary operations do not result in the 
equation of the form 


Oxi + Om +... + O-x%, =D 

with b nonzero, then the system reduces either to form (4) or 
to a form differing from (4) in the subscripts of the unknowns. 
Example. Reduce (if possible) the following system of four 
equations in five unknowns to the echelon form: 

—X| + 3x2 + 3x3 + 2x4 + $x5 = 2 

— 3x1 + 5x» + 2x3 + 3x4 + 4x5 = 2 
—3x, + x»- 5x — xs; = -2 

— 5x1 + 7x2 + x + l6x4 + x; = 10 


(5) 


O The pivotal coefficient is nonzero (it equals —1), thus, 
we can eliminate x; from all the equations of the system starting 
with the second, namely, we subtract the first equation multi- 
plied by —3, —3, —5 from the second, third, and fourth equa- 
tions, respectively. The resulting system is 


Il 
N 


—X1 + 3x + 3x3 + 2x4 + 5x5 


= 4x2 = 7x3 = 3X4 zm 11xs = 
— 8x — 14x3 — 614 — 22x5 = 
— 8x; — 14x3 + 6x4 — 24x5 = 


I 
l 
eo 


The pivotal coefficient in the boxed part of the system is non- 
zero (it is —4) and we can eliminate x. from the second and 
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third equations. We have 


-xi + 3x2 + 3x3 + 2X4 + 5xs=2 
— 4x. — 7x3 — 3x4 — llxs = —4 


0-x3 + 12x, — 2x5 = 8 


(the number of equations is one less than we started with since 
we discarded the equation 0 = 0). The pivotal coefficient in the 
residual part is zero, while other coefficients, say of xs, are non- 
zero. Interchanging the terms with x3 and xs yields a system 
in the echelon form 


—X1 + 3x2 + Sxs 2x4 + 3x3 22 
— 4x2 — lxs — 3x4 — 7x3 = —4 (6) 
— 2x5 + 12x4 8 


which is equivalent to the initial system (5). @ 

2. Solving system in the echelon form. 

We now consider the staired system (4). Two cases are possi- 
ble here: r= n and r <n. 

Case 1. r = n. Then the system has the form 


bixi + Dbi2xa +... + DinXn = C1 
baaxa  ... + DanXn = C2 

(7) 
DanXn = Cn 


and is called triangular. Noting that ban z 0 we find from the 
last equation in (7) the unique value of x,, then substituting 
it into the preceding equation and noting that ba-1,n-1 #0 
we find a unique value for x,-1, then substituting the values 
of x, and x,-1 we found into the preceding equation we find 
a unique value of x,-2, and so on. 

Thus, a system in the echelon form in which the number r 
of equations equals the number n of unknowns has a unique 
Solution. 

Case 2. r < n. We transfer the terms with Xr+1, Xr+2, -. 


14— 366 
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to the right-hand side of all the equations in (4) to obtain 


buxi + bx) +... + bDirXr = C1 — DyuraiXrsa — s. — DinXn 
bx +... + Dx = 2 - bz,r+1Xr+1 — 1. — DanXn (8) 
brXr = Cr - b, rs 1Xr«1 — — DmXn 


Substituting arbitrary numbers for xr. 1, Xr«2, ... , Xn into (8) 
yields a system of triangular form for finding the remaining 
unknowns xi, X2, ..., X. Solving this new system gives a 
unique set of values of x1, X2, ... , X. Hence we infer that 
for r « n, system (4) has infinitely many solutions as described 
by (8) (given arbitrary x«i, Xc«2, ... , Xn, we can find Ss. 
X2, ..., X; from (8). 

Unknowns whose values can be arbitrarily chosen are said 
to be free; Xr+1, Xr«2, ... , Xn are free unknowns for system (4). 

Thus, a system in the echelon form of r equations in n 
unknowns with r smaller than n has infinitely many solutions; 
here n — r unknowns are free. 

Remark. If when reducing the initial system to the echelon 
form we interchange the unknowns (type (b) elementary opera- 
tions), then any n — r unknowns may be free in the resulting 
system and not necessarily xr. 1, Xra2, ... , Xn. 

3. Analyzing a system of linear equations. 

Up till now our task was to describe all the solutions of a 
system of linear equations. In some cases, however, it is suffi- 
cient to know how many solutions the system has (no solution, 
or a single solution, or infinitely many solutions) and we say 
that we analyze the system. If the system has infinitely many 
solutions, we should also find the free unknowns (the values 
of the remaining unknowns are uniquely determined once we 
know the values of free unknowns). 

Gaussian elimination is sufficient to analyze a system of 
linear equations. As a result we either ascertain that the system 
has no solution (if an equation of the form 0-x1 + Ox + 

. + 0x, = b, where b # 0, appear) or reduce the system to 
the echelon form. Suppose we have the second case and r and 
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n are the number of equations and the number of unknowns 
in the system in the echelon form, respectively. If r = n, then 
the initial system has a single solution. If r < n, then the system 
has infinitely many solutions, and we can consider the 
unknowns undefined by the first terms of the equations in the 
system having the echelon form as free unknowns. 

Example 1. Analyze and solve system (5). 

O The system has been already analyzed (reduced to form 
(6). It has infinitely many solutions; x4 and x3 are free 
unknowns. " 

To solve system (5) means to find all its solutions explicitly. 
We transfer terms with free unknowns x4 and x; to the right- 
hand side of each equation in (6), i.e. we rewrite the system as 


-x +t 3x24 5xs—- 2- 2x4— 3x 
—4x,— llxs = —4 + 3x4 + 7x3 (9) 
— 2x5= 8- 12x 


Given x4 and x3, we can find xs, x), and xı. 

Thus, the problem is solved. However, we can continue the 
solution and express xs, X2, x in terms of x4 and x3. We have 
from the last equation in (9) xs = —4 + 6x4. Substituting this 
value into the second equation in (9) yields 


— 4x, — 11(—4 + 6x4) = -4 + 3x4 7x3 


Sige ERG 
whence x2 = 12 G4 7 
Finally, substituting the expressions for xs and x; into the 


first equation yields 


=e IC -Fu - 7+) + 5(-4 + 6x4) 
= 2 — 2x4 — 3x3 
whence x; = 14 — D xa -2 n, 
Thus, explicit expressions for xs, x2, and xı in terms of x4 
and x3 are xs = —4 + 6x4, x2 = 12 -62 2. and 


4 4 


14* 
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x= 14- 2 X4 — 7 x3. The system has infinitely many solu- 
tions. For example, putting x, = 0 and x3 = 0 yields a particu- 
lar solution xı = 14, x = 12, x3 = 0, x4 = 0, and xs = —4. @ 
Example 2. Solve the system 
2x1— Xt X3— X422 
2x1— x — 3x4= 1 
3x — XX%+ X4-8 
2x1 + 2x, — 2x3 + 5x4 = Il 


(10) 


‘© The pivotal coefficient is nonzero, nevertheless it is con- 
venient to begin by interchanging x; and x2 (in each equation). 
The result is 

—X2+2x,+ X3— X422 
—X2 + 2x1 — 3x4 = 1 

3x, — X%3+ x =8 
2x2 + 2x) — 2x3 + 5x4 = 1 


We have further 


—-%+2x%1+ x3- 4=2 -x2 + xyt2X3- my =2 
— x% + 0x 22. = -1 
—x3+3x+ x =8 
0-x3 + 6x1 + 324. = 15 


-Xx - X% + 2x, -— x22 
— X3 + Ox, — 2x, = -1 —X,+X%3+ 2x, - My = 2 
— x3 + 0x 2:4. = -1 


3x, + 3x, = 9 3x, + 3x4, = 9 
6x1 + 3x4 = 15 — 3x, = -3 


The last system has a triangular form, by proceeding from 
the last equation to the first, we obtain a single solution: 
Xx4271,x122,x21,x»220.0 
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4. Practical remark. 

The process of elimination can be simplified by considering 
the matrices of the equations of a system. We take system (10) 
as an example. We write the coefficients of the unknowns and 
the constant terms in matrix form. The result is the augmented 
matrix of system (10): 


| ll -1 2 
2 =l 0 -3 1 
3 0 -1 1 8 
2 2 =2 5.1 


We assume the following notation: we write the corresponding 
unknown under the column of its coefficients. Thus, the matrix 
of system (10) has the form 


2. =) 1 =) 2 
2 =1 0 -3 1 
8 
1 


xi X2 X3 X4 


Elementary operations for matrices céfrespond to elemen- 
tary operations for equations, namely, 

(a) an interchange of two columns corresponds to inter- 
change of two unknowns, 

(b) discarding a row consisting of zeros corresponds to dis- 
carding the equation 0-x, + ... + 0-x, = 0, 

(c) addition of one row multiplied by a constant k to another 
row corresponds to the addition of one equation of the system 
multiplied by a constant k to another equation. 

Now we write the solution of system (10): 


2 -1 ] -T 2 -1 2 I =l 2 
2 =l 0 =3 | -1 2 0 -3 1 
3 0 -2 5 8 03 -2 5 8 

1 222 —29 3531 


x1 X X3 X4 X2 Xı X3 X4 
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2g: =i a1 wd x». 
DS. 3-51 ü =i 0 =2 =] 

ibi 03-1 i 3 06-13 1 
06 0 3 15 0 06 3 15 
X2 xi X3 Xa X X3 Xl X4 
S E. E E NE. 
o -1 0 -2 -1)_ 0 ci o =e 
aao 0 3 .3 9 0 3 3 9 
0 06 3 15 Ü 960 -3 -3 
X2 X3 Xı Xa X X3 Xı X4 


where the ~ sign indicates that an elementary operation is per- 
formed on the matrix. 
The last matrix corresponds to the triangular system 


—X%.+x%3+2x,- X422 


-%3 — 2x4 = -1 
3x1 + 3x4 = 9 
a — 3x, = -3 


from which we find the unique solution: xa = 1, xi = 2, x = 1, 
x = 0. 
Example. Solve the system 
2X1 — X + 3x3 — 2x4 + 4x5 = -1 
4x, — 2x2. + 5x3 + X4 + 7x5 = 2 
2X1 — X^ X% + 8x4 + 2x5 = 1 


O We have 

2 -1 3 -2 4 -1 

4 -2 5 17 2| ~ 
2 -1 1 8 2 1 0 0 -2 10 -2 2 


X1 X2 X3 X4 Xs X1. X2. X3 X4 OS 


et 
l 
o = 
I 
æ US 
i 
aA N 
l 
— A 
t 
> = 
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2 3 -1 -2 4 -1 
-.[0 -1 0 5 -1 4 

0. =2 0 10 -2 2 

X1 X3 X2 X4 xs 

2 3 -1 -2 4 -1 
-|[0 -1 0 5 -1 4 

0 0 0 0 0 -6 


X1 B X X4 X5 


The last row corresponds to the equation 
0x + O-x3 + 0x; + O-x4 + 0xs = —6 


Thus, the system has no solution (is inconsistent). @ 

5. Consequence of Gaussian elimination for homogeneous 
systems. 

Remember that the system is said to be homogeneous if all 
its constant terms are zero, namely, 


auxi + Q12%2 + o... + AinXn = O 
Q21X1 + d2220 + ... + AnXn = 0 
GmiX1 + dm2X2 + ... + AmnXn = 0 


A homogeneous system is always consistent: one of its solu- 
tions is of the form x, = 0, x2 = 0, ..., Xan = 0 and is called 
the trivial solution. When considering a homogeneous system, 
it is important to establish whether it has at least one nontrivial 
(nonvanishing) solution. 

Theorem. A homogeneous system in which the number of 
equations is smaller than the number of unknowns (m « n) 
always has a nontrivial solution. 

a When solving the system by Gaussian elimination, we can 
never obtain an equation of the form 0-x%1 + ... + 0x, = b 
with b # 0, since all the constant terms in the system are zero. 
Thus, the process ends with a system in the echelon form in 
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which the number of equations is still smaller than the number 
of unknowns. But such systems have infinitely many solutions, 
nontrivial ones included. 


Exercises to Chapter 9 


1. What is a solution of a system of linear equations in n unknowns xi, 
X24. 5 903. Xn 

2. Find all the solutions of the equation 2x, = 3 if it is in (a) one unknown 
xi and (2) n unknowns xi, X2, ... , Xs. 

3. Solve the following systems of equations by Gaussian elimination. Find 
whether the system is consistent. 1f the system is consistent, reduce it to the 
echelon form. If there are free unknowns, use them to express the remaining 
unknowns. 


Xi t 2X2 x324 x+2y+2z=1 
(H) (3x1 — 5x2 + 3x3 = 1 Q) 2x+ y-2z=2 
2n + 7x2 - x3 = 8 2x+2y+ z2=3 
5x+4y+7z=6 
2 = y+ z= -2 X — 2x2. + 3x3 — 4x = 0 
(3) x+2y+3z= -l (4) 2x1 + X:— x20 
x—-3y-2z2-23 -3xi t x; X3 —-2%=0 


(5) 


Xi — 2x2 + Xs — x4 = -l 


M—-—-2m+%3+ x,—-1 
xy—2x; + X3 534-5 


Xi + 2x2. + 3x3 x = —-4 


x+ y-3z=-l 
2x+ yr 2z 
x+ yt z 
x + 2y -3z 


(10) 


li] 
- w = 


(3) 


M+ X2 + 2g + 3%= 1 
o) 3x1— x2— x3— 2x = -4 
2x) + 3X4— 43 - u= -6 


2x1 — 
4x, — 2x2 + 5xs + X, + 7X5 
2X 


x+y+z+t=1 

x+y-z-t=1 

(6) 
x—-ytz-tz-l 


x-y-ztt=l 


2a- R+ g- 
2x1 — X 
3x1 - a+ u= 
2x1 + 2x2 — 2x3 + 5x, = 0 


(8) 


X2 + 3x3 — 2x4 + 4x5 = -1 


ll 
- b 


X) X) 8x. + 2x5 
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4. Find a for which the system is consistent and solve it for those a: 
1 
2 


GQ) [x+ y+ z=1 (2) (2m + m+ G+ X 
x+ay+ z-21] Xi + 2X3 — X + 4x4 
x+ ytaz=l1 Xi + 7x2 — 4x3 + 11x 


(3) (5x1 — x2 + 2x3 + X, = 2a 
2x1 + x2 + 4x3 — 2x4 = 1 
xı — 3x2 — 6x3 + 5x, = 0 


5. Find the equation of a sphere in space passing through the points (a) 
Mil, 0, 0), M2(1, l, 0), M,(1, l, 1), M,(0, l; 1), (b) Mı(2, 2, 1), M2(2, l, 2), 
M(l, 2, 2), M,(0, 0, 3), (c) M3, l, 1), M2(3, 0, 2), M;(2, l, 2), M,(, =l; 3). 

6. Find the equation of a third-degree parabola passing through the points 
Mil, 0), M2(0, —1), Mi( - 1, -2), Ms(2, 7). 

Remark. A third-order parabola is defined by the equation y = ax + 
bx! + cx +d. 


Chapter 10 


VECTOR SPACES 


10.1. ARITHMETIC VECTORS AND OPERATIONS WITH THEM 


1. Arithmetic n-dimensional vector space. 
Definition. Any finite set of numbers 


1, Q2, ... , An (1) 


is called an arithmetic vector, and the numbers are the coor- 
dinates of this vector. 

We use the notation (di, a2, ..., a4) for an arithmetic 
vector. 

For instance, ( —1, 2, 0, 7) is an arithmetic vector whose 
coordinates are —1, 2, 0, 7. 

We can interpret geometrically only arithmetic vectors having 
one, two, or three coordinates, namely, each set of the form 
(ai), (a1, 32), OF (a1, a2, a3) is associated with a “true” vec- 
tor, i.e. a directed line segment on a straight line, in the plane, 
or in space (in a Cartesian coordinate system). An arithmetic 
vector having more than three coordinates cannot be interpret- 
ed geometrically, since by definition such a vector is a purely 
arithmetic object (the set in (1)). 

Arithmetic vectors are often encountered in mathematics 
with different numbers of coordinates. Here are several ex- 
amples: 

(a) an ordinary (geometric) vector on a straight line can be 
considered as an arithmetic vector having one, two, or three 
coordinates, 

(b) each row in an zith-order determinant is an arithmetic 
vector with n coordinates, 

(c) the coefficients of the unknowns in the linear equation 


Q1X1 + aX  ... t AnXn = D 
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form an arithmetic vector (di, d2, ..., an} having n coor- 


dinates, any solution (x, x}, ... , x?) of this equation is an 
arithmetic vector with n coordinates. 
Definition. Two arithmetic vectors (di, a2, ... , Gn) and 


(ar, a2, ..., an ) are said to be equal if and only if they have 
the same number of coordinates (n = m) and if the respective 
coordinates are equal, a; = aj, a2 = až, ..., Qn =Qn. 

In what follows we shall omit the word “arithmetic” and sim- 
ply say “vector”, and use lower-case letters in bold face to 
denote vectors. If two vectors a and b are equal, then we write 
a=b. 

Definition. The sum of two vectors having the same number 
of coordinates a = (4,42, ... , anyand b = (bi, bo, ... , De) 
is the vector a + b defined by the equation 


a + b= (a, + bı, a2 + bo, ..., An + bn?) 


For instance, (—1, 0, 7, 5 + <4, 1, -—7, 3) = (3, 1, 0, 8). 

Thus, to add vectors means to add their respective coor- 
dinates. This definition is in line with that given earlier for the 
addition of vectors in a three-dimensional space (the coor- 
dinates of the sum of two vectors are equal to the sums of the 
corresponding coordinates of the vectors). 

Definition. The product of the vector a = (a1, a2, ... , Qn) 
by a scalar k is the vector ka defined by the equation 


ka = (kay, Kao, ..., Kan) 


Vector addition is commutative and associative: 


at+b=bta 
(a+b)+c=a+ (b+ 0) 


We can also easily verify the following equations: 
(k + Da = ka + la 
k(a + b) = ka + kb 
k(la) = (kDa 
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Definition. A vector whose all coordinates are zero is called 
the zero vector and is denoted by 0. 

Strictly speaking, such a notation is inaccurate since 0 means 
any of the vectors (05, <0, 05, <0, 0, 05, ... . However, in what 
follows it is clear how many coordinates the vector 0 has in 
each specific case. 

Obviously, a + 0 = a whatever the vector a having the same 
number of coordinates as the vector 0. 

The vector ( —1)a is the negative (or the opposite vector) of 


a and denoted by —a. If a= (aj, ao, ..., Qn), then 
—a = (-@1, —@, ..., —d,). Hence 
a+(-a)=0 
Note that if ka = 0, then either k = 0 or a = 0. 
o Given a = (081,405, ...,@n), the equation ka = 0 means 
that 
kay = 0, ka = 0, ..., kan = 0 (2) 


If k = 0, then there is nothing to be proved. If k z 0, then we 
have a; = 0, @ = 0, ..., @, = 0, whence a = 0. m 

Definition. The set of all arithmetic vectors having the given 
number n of coordinates in which vector addition and multipli- 
cation by a scalar are defined as above is called an arithmetic 
n-dimensional vector space and denoted by R". 

We shall omit the word “arithmetic” and simply say an 
n-dimensional vector space R" or, in short, an R" space. 

We can interpret geometrically only the spaces R!, R?, and 
R?; R! can be associated with the set of vectors on a straight 
line, R? with the set of vectors in the plane, and R? with the 
set of vectors in space. 

2. The set of all solutions of a homogeneous system of linear 
equations as a subspace of R”. 

The theory of arithmetic vectors allows one to interpret a 
number of facts concerning systems of linear equations. We 
take a homogeneous system of linear equations as an example. 

In the notation adopted in Chapter 9, the general form of 
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a homogeneous system of m linear equations in n unknowns is 


1X1 + di2X2 + ... + AinXn = 9 

Q21X1 + Q22X2 + ... + GonXn = 0 
(3) 

Am1X1 + am2X2 + ... + AnnXn = 0 
Each solution of (3) is a set of n numbers xi, X2, ... , Xn and 
can therefore be considered as a vector x from R": x = (xi, 
X2, ... , Xn). In this sense we can speak about the sum of two 


solutions and the product of a solution and a scalar. 

The set of all solutions of a homogeneous system possesses 
two specific properties. 

1°. The sum of two solutions is a solution. 

2°. The product of a solution and a scalar is a solution. 

Let us prove the first property. Suppose e = (a1, o2, ..., 
an) and B = (81, 82, ... , Bn) are two solutions of system (3). 
We have 


a+B = (a1 + B1, a2 + fo, ..., ant Bnd 


We verify whether œ + £ is a solution of (3). We should substi- 
tute the coordinates of the vector a + 8 for the unknowns into 
the left-hand side of each equation of the system to ascertain 
that the result is zero. Take, for example, the first equation in 
(3). We have 


ai(a1 + B1) + ai2(a2 + B2) + ... + Gin(an + Bn) 
= (a1101 + d1202 + ... + Ginn) 
+ (afi + ai282 + ... + GinBn) 


the first sum on the right-hand side is zero since æ is a solution 
and the second sum is zero since f is a solution. 

Let us prove property 2°. Suppose the vector œ = (a1, 
02, ..., Qn) iS a solution of (3). We must prove that the 
vector i 

ka = (kai, kaz, ..., Kan) 


where k is a scalar, is also a solution. Substituting the coor- 
dinates of this vector for the unknowns into, say, the first equa- 
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tion in (3) yields 


aii: Koi + djz:Kog + ... + adis Kon 
= k(ady101 + d1202 +... + Ginn) = K-0-0 


Similarly, we can prove this for the remaining equations. 

We denote by M the set of all solutions of system (3). This 
is a set of vectors in R”. According to what we have proved, 
the set M possesses the following properties. 

1°. If vectors x and y belong to M, then their sum x + y also 
belongs to M. 

2°. If a vector x belongs to M, then the vector kx, with k 
a scalar, also belongs to M. 

Any set of vectors in R" possessing these properties is called 
a subspace. Thus, we can say that the set of all solutions of 
the homogeneous system (3) is a subspace of vector space R". 


10.2. LINEAR DEPENDENCE OF VECTORS 


1. Linear combination of several vectors. 

Concrete problems deal, as a rule, with a set or system of 
vectors rather than with a single vector (all vectors in a system 
have the same number of coordinates). In this case the vectors 
are denoted by the same letter with different subscripts. For 


instance, 
a= (1, 0, 3, =2, -1), a= (-1, l, 4, 3, 05, (1) 
83 = (—-5, 3, 5, 3, 7) 


is a system of three vectors each of which has five coordinates, 
that is belongs to the space R°, while 


bi = (1, 1, 1, 1), be = <1, 1, 1, 1) (2) 
is the system of two véctors belonging to R*. Do not be con- 
fused to see that b, = b; since a system of vectors may have 


repetitions. 
Suppose 


ai, 82, ... , ap (3) 
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is a system of vectors from R”. We choose arbitrary numbers 
ki, Ko, ... , Kp and compose the vector 


a = kiai + kma +... + Kpap (4) 


Definition. Any vector a of form (4) is a linear combination 
of vectors ai, @,...,apandki, k2, ... , Kp are its coefficients. 

Example. Find the linear combination 2a; — 3a; + a3 of vec- 
tors a1, a, and a3 from (1). 

O Adding the vectors 2a; = (2, 0, 6, —4}, —3a; = (3, -3, 
—-12, —9), a3 = (—5, 3, 5, 3) yields 


2a, — 3a: + a3 = <0, 0, -1, 10) e 


Instead of saying “the vector a is a linear combination of 


vectors 81, 82, ... , ap", we can say that “a is linearly expressed 
in terms of the vectors a1, 82, ... , ap" or “a decomposes into 
vectors a1, 32, ... , ap”. 


2. Linear dependence and independence of a system of 
vectors. 

One of the basic concepts of vector space theory is that of 
linear dependence. 

Definition. The vectors a1, 32, ... , ap are said to be linearly 
dependent or form a linearly dependent system if there are 
numbers cj, C2, ... , Cp not all zero such that 

? C181 + (2a + ... + C985 = 0 (5) 
Otherwise, ie. when (5) is only valid for c. =@ =... = 
Cp = 0, we say that the vectors a1, a2, ... , ap are linearly in- 
dependent or the system of these vectors is linearly in- 
dependent. 

Let us stress that the concepts of linear dependence and 
linear independence only refer to systems with finite numbers 
of vectors. 

Theorem. A system consisting of one vector is linearly de- 
pendent if and only if the vector is zero. A system consisting 
of several vectors is linearly dependent if and only if at least 
one of the vectors can be represented as a linear combination 
of the others. 
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Suppose a system consists of one vector a,. If the system 
is linearly dependent, then there is a nonzero number c, such 
that cra; = 0. Since c, z 0, this equality is equivalent to 
a, = 0. Thus, the first statement is proved. Let us prove the 
second statement. 

Suppose the system 


81, $9 ..., app p> 1 (6) 
is linearly dependent. This means, by definition, that there are 
numbers ci, C», ... , Cp, not all zero, such that 

C181 + C982 +... + Cpap = 0 


Suppose c; # 0. Adding the vector — cia; to the last equation 
yields 
—€18, = C982 + ... + Cpa 


and multiplying this equation by —1/c, gives 


c c 
a = (- S) a + elt (- 2)» 
C1 C1 


Thus, we have shown that if system (6) is linearly dependent, 
one of its vectors can be expressed as a linear combination of 
the others. Let us prove the converse. 

Suppose one of the vectors of (6) can be represented as a 
linear combination of the others, for the sake of clarity, let it 
be the vector ai: 


a = kaa +... + Kpap 


where k2, ... , kp are numbers some of which or all may be 
zero. We rewrite the equation as 


(-Iai + k2a2  ... + kpap = 0 


that is, add the vector —aı. This equation has the form as in 
(5), where ci = 71,622 k, ..., œ = Kp, with at least one 
of the numbers ci, ... , cp nonzero (in our case c; x 0). Thus, 
system (6) is linearly dependent. m 

Remark. The assertion in the theorem “there is a vector 
which can be represented as a linear combination of the others” 
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does not mean that every vector can be expressed in terms of 
the others. We illustrate this by way of an example. Let a be 
a nonzero vector. We consider a system of two vectors a, = 0 
and az = a. The system is linearly dependent since, for instance, 
1:a; + 0:a2 = 0 (cı = 1 is thus nonzero). Consequently, one of 
the vectors must be a linear combination of the other. In this 
case a; can be expressed in terms of a? since a; = 0-a2, while 
a) cannot be expressed in terms of a, since a; # 0. 

Generally speaking, it is not so easy to establish whether a 
system of vectors is linearly dependent (or independent). 
However, it is easy for a system of two vectors. Indeed, linear 
dependence of the system a;, az means that one of the vectors, 
say a1, can be expressed in terms of the other, ie. a; = kaz. 
Two vectors, say a; and a2, which form a linearly dependent 
system, are collinear. In this case the coordinates of a; are 
proportional to the corresponding coordinates of a2. For exam- 
ple, the vector a; = (— 1, 0, 3, 4) is collinear with the vector 
a: = (—5, 0, 15, 20). 

Thus, a system of two vectors is linearly dependent if and 
only if the two vectors are collinear. 

3. Geometric interpretation of linear dependence for a sys- 
tem of three vectors in R°. 

We introduce a Cartesian coordinate system in ordinary 
space. Then every arithmetic vector from R?, i.e. every set of 
three numbers (a1, a2, @3) can be associated with an ordinary 
vector, ie. a directed line segment a in the customary space; 
for clarity we assume that a originates from a fixed point O, 
the origin. This geometric vector can be considered to be the 
image of the arithmetic vector. We denote both of them by a. 

To explain geometrically linear dependence of a system of 
three vectors a, b, c in R? we use the lemma from Sec. 3.10, 
which can be formulated thus: A system of three vectors in 
R? is linearly dependent if and only if the vectors are coplanar. 

Example 1. Find whether the system of the three vectors 


a-(-107,2, 3 
b = (2, 0, -14, —4, -6) 
in R’ is linearly dependent. 


15—366 
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© The coordinates of b are proportional to the correspond- 
ing coordinates of a: 


2:(-1) = 0:0 = (-14):7 = (-4):2 = (-69:3 


Thus, the system a, b is linearly dependent (b = 2a). @ 

Example 2. Find whether the system of the three vectors 
a, = (2, 3, 5), a2 = (—4,5, 7), a3 = (10, —7, —9) is linearly 
dependent. 

O A system of three vectors R? is linearly dependent if the 
vectors are coplanar, and in turn, according to the theorem in 
Sec. 3.10, the vectors are coplanar if and only if the determinant 
formed from their coordinates is zero. We have the determinant 


p 3 5 
-4 5 T 
10 -7 -9 


which is zero (check this). Thus, a), a2, and a; are linearly 
dependent. @ 

Example 3. Find whether the system of the three vectors 
a; = <1, 2, 3, 0), a2 = (-1,03, —2), a; = (-13, 12, - 5) 
is linearly dependent. If the answer is affirmative, set up an 
expression relating the vectors. 

© According to the definition of linear dependence, we 
must find whether the equation 


X131 + X232 + X333 = 0 (7) 


in three unknowns x; , X, X; has at least one nontrivial solution. 
We write (7) using vector coordinates 


xil, 2, 3, 0) + x€-1, 0, 3, -2) 
*x-1, 3; 12, m5) = <0, 0, 0, 0) 


Reducing to the echelon form yields 
(xrl + x(71) + x3°(-1), x2 + x»0 + x3°3, 
Xp3 + x3 x3: 12, 
x1:0+ X2°(-— 2) + xX:(— 5) = (0, 0, 0, 0) 


Ch. 10. Vector Spaces 227 


Since the equality of two vectors means the equality of their 
respective coordinates, the last vector equation is equivalent to 
the system of four numerical equations 


Xi— xR- X420 
2xi + 3x3 = 0 (8) 
3x, + 3x2 + 12x3 = 0 
-—2x%.- $x; =0 
We solve (8) using Gaussian elimination: 
X=] X3 -— X320 Xp = X2— x3 = 0 
2x1 + 3x3 = 9 2x% + 5x3=0 


3x1 + 3x2 + 12x = 0 6x) + 15x3 = 0 
— 2x5 — $x3 = 0 —2x, — $x3 = 0 


ES xl- x2—x3-20 
2x2 + 5x3 = 0 


The unknown x; in the last system is free (it may take any nu- 
merical value). Hence (8) has nontrivial solutions. For instance, 


putting xs = —2 yields the solution xı = 3, x2 = 5, x = —2. 
Consequently, ai, a2, and a; are related as 
3a, + Sa, — 2a; = 0 (9) 


Thus, the system a1, a2, a3 is linearly dependent and its vec- 
tors are related by (9). @ 


10.3. PROPERTIES OF LINEAR DEPENDENCE 
We consider the system of vectors 
81, 82, ..., ap (1) 
from R”. 
1°. If one of the vectors is the zero vector, then the system 
is linearly dependent. 


n Suppose, for instance, that the first vector in (1) is a non- 
zero vector. Then obviously 


0 = 0-a2 + 0:233 +... + O-ap 
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which shows that a; is a linear combination of the other vec- 
tors, that is, system (1) is linearly dependent. m 

2°. If a part of a system is linearly dependent, then the entire 
system is linearly dependent. 

o Suppose a part of system (1), say, the first k vectors 
(k < p), is linearly dependent. This means that there are num- 
bers ci, @2, ... , cx not all zero such that 


C13; + C232 + ... + cea, = 0 
which implies that 
cay +... + CBk + ÜO-acu i +... + O0-a, = 0 


ie. the entire system (1) is linearly dependent. m 

3°. If the linearly independent system (1) becomes linearly 
dependent, once a vector a is added, then a can be represented 
as a linear combination of the vectors of (1). 

o The linear dependence of the system a1, ... , ap, a means 
there is an equation 


Cia; + ... + Cpap + ca- (2) 


where ci, C2, ... , Cp, € are not all zero. Note that the coeffi- 
cient of a in (2) is necessarily nonzero; otherwise, (2) would 
imply that the vectors ai, ... , ap are linearly dependent. Since 
€ # 0, a can be represented as a linear combination of ai, ... , 
ap. E 

1. The echelon form of a system of vectors. 

Another property of linear dependence concerns a system of 
vectors which has a special structure. We say that such systems 
have the echelon form. 

Definition. A system of vectors 


a, = (an, a2, vey Op, «++ ys ain?) 

a2 = (0,`~ an, Ses Alps ses n? (3) 
s 

»* wc o» 4 * E 3€ * X ox LIII 

ap — (0, 0. oy lpp; » apm?) 


is said to have the echelon form. 
Here the coordinates below the dashed “diagonal” are all 
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zero, while the coordinates next to the diagonal and above it, 
ie. Qi1, G22, ... , App, are all nonzero. 

Obviously, p < n for the echelon form of a system, that is, 
the number of vectors does not exceed the number of vector 
coordinates. 

4°. A system of vectors having the echelon form is linearly 
independent. 

o Let us find when a vector of the form 


a = ca, + C282 + ... + Cpap 


is the zero vector. Using (3) we find that the first coordinate 
of a is cd11; if it is zero, then cı = 0 since a1; # 0. Then the 
term cia; in the equation for a vanishes and the second coor- 
dinate of a is c2422; if it is also zero, then c2 = 0 since a22 # 0, 
and so on. Thus, we infer that 


Cia; + C282 + ... + Cpap = 0 


is valid if c1 = c2 = ... = Cp = 0, which implies that system 
(3) is linearly independent. m f 

Example. Show that the vectors in R’ a = 
a: = (-1, 0, 2, 3, — 5), and a; = <0, 0, -2, 
independent. 

O Arranging the vectors in the following order: a? = (—1, 
0, 2,3, —5), a1 = (06,3, 4, —2, 1), a = (0,0, —2, 1, 15 yields 
the echelon form of a system of vectors. @ 

2. The number of vectors in a linearly independent system. 

The corollary follows from property 4°: there is a linearly 
independent system in R" consisting of n vectors exactly. 

An example is the echelon form of a system of vectors 


ei = 0,0, ..., 0) 


«6, =2, 1), 
1, je linearly 


(4) 


one of whose coordinates is 1 and all the others are 0. Any 
subsystem of it is also linearly independent, otherwise, the 
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linear dependence of the subsystem would imply that the sys- 
tem is linearly dependent. 

Naturally we would like to know whether there is a linearly 
- independent system of more than n vectors in R”. In the next 
subsection we show that any system in R" consisting of more 
than 7 vectors is linearly dependent. 

We mention in conclusion an important property of the sys- 
tem ej, eo, ... , e, (see (4), namely, any vector a from R” can 
be represented as a linear combination of the vectors e;, 
€2, ... , €x. In fact, suppose we have vector a = (a1, à», ... , 
an}. Multiplying ei by ai, e2 by a2, and so on and adding the 
resulting vectors yield 


aye; + d2€02 + ... + age 
= (mM, 0, ..2, 0) = (0, a, ..., OO +... + 
«0, 0, cos an) 
= (a, Q2, ..., Mn) =A 


Thus, there is a system of n linearly independent vectors in 
R" in terms of which any vector of R" can be linearly 
represented. 


10.4. BASES IN SPACE R" 


Definition. A linearly independent system of vectors in a 
space R” is called the basis of R” if any vector from R” can 
be represented as a linear combination of any vector of the 
system. 

The last proposition formulated in the previous subsection 
implies that there is a basis composed of n vectors in R”. Other 
bases also exist for R”. j 

Lemma. /f there is a basis of R" composed of p vectors, then 
any p + 1 vectors in R” are linearly dependent. 

a For the sake of simplicity, let p = 2. Thus, we have a basis 
of two vectors a; and a2 in R": we prove that any three vectors 
bı, bz, and b; are linearly dependent. 

We denote the basis vectors by a and b instead of a; and a. 
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Since a and b form a basis, any vector bi, b2, bs can be 
represented in terms of a and b: 


bı = qa + Bib bz. = œa + fb, b3 = a3a + sb 


Multiplying the first equation by a number x (so far ar- 
bitrary), the second equation by x2, and the third equation by 
x3 and adding the results yield 


xibi + xb» + xb; 
= Xi(o18 + Bib)  x(o2a + Bb) + xs(o5a + 63b) 
= (Xo + X202 + X303)a + (0181 + 282 + X383)b 


If we can choose xi, x», X such that not all are zero and both 
expressions in parentheses are zero 


0X1 + 02X2 + 03x; = 0 


(1) 
Bii + Bax + B3x3 = 0, 


then 
xib; + xb; + X3b3 = 0 


is valid, i.e. the system bi, b», b; is linearly dependent. Thus, 
in order to prove the theorem, it is sufficient to show that sys- 
tem (1) has a nontrivial solution. Since system (1) is homogene- 
ous and the number of its equations is smaller than the number 
of the unknowns (two equations in three unknowns), it has non- 
trivial solutions according to the theorem in Sec. 9.2, Item 5. 
Thus, the theorem is proved for the special case of p = 2. 

A similar argument can be used for the general case (for any 
p). Instead of system (1), we have a homogeneous system of 
p equations in p + 1 unknowns which, according to the the- 
orem, has a nontrivial solution. m 

Given the lemma, we can easily prove the following theorems. 

Theorem 1. Any two bases of a space R" have the same num- 
ber of vectors. 

Note that since one of the bases of R”, namely, the basis 
e1, €, ... , €n in the preceding subsection, consists of n vectors, 
any basis of R" has n vectors. 

o Suppose a basis of R” has p vectors and another basis 
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has q vectors. We shall prove this indirectly: let p ¥ q, say 
D < q. Then we can choose p + 1 vectors from the second bas- 
is. According to the lemma, these vectors must be linearly de- 
pendent, and, therefore the second basis is linearly dependent, 
but this contradicts the definition of a basis. m 

Theorem 2. Any system in R" having more than n vectors 
is linearly dependent. 

o A space R” contains a basis of n vectors. According to 
the lemma, any system of n + 1 vectors in R" must be linearly 
dependent. But then, generally, any system of more than n vec- 
tors is linearly dependent, since we can always choose a sub- 
system of n+ 1 vectors and this is linearly dependent 
(according to the lemma). m 

In conclusion let us characterize all the bases of R". 

The bases of R" are different linearly independent systems 
of n vectors. 

Given a basis, its vectors are linearly independent and their 
number is n. Conversely, any linearly independent system of 
n vectors 


81, 82, ... , An (2) 


is a basis of R". To make sure, we add a vector a in R" to system 
(2) and obtain a system of n + 1 vectors which according to 
Theorem 2 must be linearly dependent. By property 3?, Sec. 
10.3, of linear dependence we see that a can be represented as 
a linear combination of ai, a2, ... , an. This proves that the 
system a1, 32, ..., An is a basis of R”. m 

Example 1. The system of vectors a; = (l, 2, 3), a2 = 
(-1,0,3), a = (2,5, —2), a4 = (4, 12, 2) from R? is linearly 
dependent since the number of vectors is greater than three. 

Example 2. Verify whether the vectors a; = (1, 2, 3), 
az = (0, 1, —1), a3 = (2, 4, 5) form a basis in R?. 

O Since we have three vectors, it only remains to be estab- 
lished whether they are linearly independent or, which is the 
same, whether the equation 


X181 + X232 + X3a3 = 0 
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has a trivial solution (x1 = x? = xs = 0). This equation is 
equivalent to the system 

X1 + 2x3 = 0 

2X1 + X2 + 4x; = 0 

3x1 — x; + 5x; = 0 
According to the theorem in Sec. 3.9, a homogeneous 3 x 3 


system has a nontrivial solution if and only if its determinant 
is zero. The determinant of our system 


1 02 
2 14|-2-1 
34 


is nonzero, thus, the system has only a trivial solution. Hence 
it follows that the vectors a1, a2, and a; form a basis of R?. 6 


10.5. ABSTRACT VECTOR SPACES 


Up till now we studied an arithmetic vector space R" which 
is the set of all arithmetic vectors (a1, @2, ... , Q4) in which 
two algebraic operations, viz., vector addition and the multipli- 
cation of a vector by a scalar, are defined. 

A thorough analysis of the basic theorems on arithmetic vec- 
tors shows that the notation of a vector in the form of a line 
of several numbers is not essential. The important thing is that 
there are two operations: vector addition and multiplication by 
a scalar, which possess certain properties permitting to operate 
on expressions of the form k1aj + kya. + ... + Kpap, where 
81, ... , ap are vectors and ki, ... , kp are scalars and which 
obey ordinary algebraic laws. 

This suggests a principle for generalizing the notion of an 
arithmetic vector. We should call a vector any object for which 
two operations, viz., vector addition and multiplication by a 
scalar, are defined, provided the operations obey some natural 
requirements. The advantage of such an approach is that there 
is now no need to consider a vector to be a set of numbers 
and we may call objects of diverse nature, such as functions 
and matrices, vectors. 
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1. Definition of an abstract vector space. 

Definition. A set L is called a vector (or linear space) and 
its elements are called vectors if on this set 

(I) the operation of addition is defined: every pair of vectors 
a and b from L is associated with a third vector from L calied 
the sum of a and b and denoted by a + b, 

(II) the operation of mui.iplication of a vector by a scalar: 
each pair a, k (where a isa vector and k is a scalar) is associated 
in L with a vector called the product of k and a and denoted 
by ka, 

(III) these operations possess the following properties: 

(1) a b = b+ a for any two vectors a and b in L, 

(2) (a + b) + e = a + (b + c) for any three vectors a, b, and 
c, in L, 

(3) there is a unique vector 0 in L such that a + 0 = a for 
any a in L, 

(4) for any vector a there is a unique vector a' such that 
at+a’=0 in L, 

(5) l-a = a for any a in L, 

(6) k1(423) = (ki k2)a for any scalars kı, kz and any a in L, 

(7) (kı + kz)a = kia + kza for any scalars kı, kz and any 
ain L, 

(8) k(a + b) = ka + kb for any scalar k and any vectors a 
and b. 

The vector 0 in (3) is called the zero vector; the vector a’ 
in (4) is called the negative of a and is denoted by —a. 

The branch of mathematics that studies vector spaces is 
called /inear algebra. 

Note that we have considered two sets of two different opera- 
tions but used the same notation for each set, viz., vector addi- 
tion and scalar addition, and multiplication of a vector by a 
scalar and multiplication of scalars. However, this is not umbig- 
uous since it is clear from the notation which operation is in 
use. For instance, given the equation 


(ki + kja = kia + ka 


the + sign on the left-hand side means addition of scalars, 
while the + sign on the right-hand side means vector addition. 
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2. Examples of vector spaces. 

Space R”. We considered this space earlier. Its elements 
(arithmetic vectors) are the sets of n numbers whose addition 
and multiplication by scalars are defined by the rules 


(0, By ..., dn) + (bi, bo, ..., On) 
= (a, + bı, a. + bo, ores Qn + bn), kla, GM, weeny an) 
= (kai, ka, ..., kan) 


Since n can assume infinitely many values 1, 2, 3, ... , we 
have an infinite series of vector spaces: R!, R?, R?, .... 

Function space. We can consider various mappings of the set 
R of real numbers into itself or, which is the same, various 
functions 


y = f(x) 


defined on R and varying in R. Let us introduce the operations 
of addition and multiplication for functions. To add two func- 
tions fi(x) and f2(x) together is to construct a new function 
AGO + f(x) whose value for any x = a is fi(a) + f(a), mul- 
tiplication of f(x) by a scalar k yields Kf(x) whose value for 
any x = a is kf(a). 

The set of all functions on which the operations of addition 
and multiplication by a scalar are defined is a vector space. 

Space of matrices. We consider various matrices of the form 


[^ 2 

a2; 22 

having two rows and two columns. Addition of matrices and 
multiplication of a matrix by a scalar are defined as follows: 


(e "i 4 (o _ a tbu az + a 
ü 2 ba bz an + Dn G22 + bz 
«(m 2 C e i) 
Q3 anz kanı kaz 
We can easily verify that all the conditions in the definition 
of a vector space are satisfied here. Thus, we have a vector space 
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which is the space of 2 x 2 matrices (i.e. of matrices having 
two rows and two columns). 

Vector spaces of 3 x 3, 4 x 4, etc. matrices can be construct- 
ed in a similar way. Vector spaces of m x n matrices, where 
m x n, also exist. 

3. Basis. 

We can introduce a number of concepts for an abstract vector 
space in the same way as we did for the space R". Specifically, 
we can speak of a linear combination 


kya; + K232 + ... + Kpap 


of several vectors a1, a2, ... , ap and of linearly dependent and 
linearly independent systems of vectors. However, the proper- 
ties of L may differ from those of R" in one respect. In order 
to formulate this (possible) difference, we introduce the impor- 
tant concept of a basis of the space L, which actually repeats 
the definition of a basis of R". 

Definition. A basis of a vector space L is a linearly indepen- 
dent system of vectors from L such that any vector from L can 
be represented as a linear combination of the system's vectors. 

A basis exists in R", an example being the system of 7 vectors 
€1, 62, ... , e constructed in Sec. 10.3, Item 2. There are vector 
spaces L which have no basis. In order to exclude such spaces 
from our consideration, we must add one more condition to 
(I), (ID, and (III) in the definition of a vector space given in 
Item 1: 

(IV) the space L has at least one basis. 

4. The coordinates of a vector in terms of a basis. 

Let us choose a basis of L consisting of n vectors 


Èi, €2, ... 5 On (1) 


Any vector a from L can be expressed in terms of the basis, 
ie. represented as a linear combination of the basis vectors 


a = kiei + Koe + ... + Knen (2) 


Theorem. Any vector a can be expressed in terms of the basis 
in a unique manner. 
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o Suppose we have another representation of a in the basis 
besides (2): 


a = kje, + Ke +... + knen (3) 
Subtracting (2) from (3) yields 
(ki — kijer + (kd — kıje2 + ... + (kn — Kn)€en = 0 


But by assumption the vectors ei, €2, ..., e, are linearly in- 
dependent since they form the basis. Consequently, the last 
equation can only be valid if all the coefficients ki — ki, 
ki — k2, ... , kn — Kn are zero, that is, 


ki = ki, kz = ki, pony Kn = kn 


Hence, representations (2) and (3) are the same. m 
Definition. The coefficients ki, k2, ... , Ka in the representa- 
tion of the vector a in terms of the basis (1) are called the coor- 
dinates of a in the basis. 
Equations 


(kiei + Kaeo + ... + Kn€n) + (lei + bez +... + les) 


- (a T h)ei * (ko + b)e; s usen (Kn T In)en 
and 
c(kiei + koe) + ... + Knen) 


= (cki)ei + (ckz2)e2 + ... + (CKn)en 


following from the properties of vector addition and multipli- 
cation of a vector by a scalar show that in order to add vectors, 
we add their respective coordinates in the given basis, and when 
multiplying a vector by a scalar we multiply its coordinates in 
the given basis by that scalar. 

Example 1. The vectors a and b have the coordinates in a 
basis of L —1, 2, 3, 0, 6 and 1, 1, —3, 0, —4, respectively. 
Find the coordinates of the vector 2a — 3b in the same basis. 

© The coordinates of the vector 2a are — 2, 4, 6, 0, 12 and 
those of —3b are —3, —3, 9, 0, 12. Hence it follows that the 
coordinates of 2a — 3b are —5, 1, 15, 0, 24. @ 
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Example 2. Verify that the arithmetic vectors a; = (1, 2, 3), 
a: = (0, 1, -1), a3 = Q2, 4, 5) form a basis of R? and find 
the coordinates of the vector a = (5, 13, 9) in that basis. 

O We showed in Sec. 9.4 that the system ay, a2, a3 is a basis 
of R°. Thus it remains for us to find the coordinates of a in 
terms of the basis a;, a2, a3. 

By denoting the desired coordinates by xi, x2, xs, we can 
write 

X18, t X282 + X333 =a (4) 


By assumption we have 
X181 + X282 + X383 
= (Xi + 2x3, 2X1 + X» + 4x3, 3x1 — x2 + 5x3) 


and therefore (4) is equivalent to the system 


X1 + 2x3 = 5 
2x1 + X t 4x3 = 13 
3x1 -Xx + 5x3 =9 
Gaussian elimination yields 
Xi + 2x3 = § 
2 = 
—%2 -— Xj = -6 
whence it follows that x.=3, x3=3, x = —l. Thus, 
a = —8; + 3a2 + 3a3 whose coordinates in the basis a;, a2, a; 


are —1, 3, 3. 

5. Representation of an abstract vector space L as an arith- 
metic vector space R”. 

Suppose a vector space E has a basis consisting of n vectors 
ei, €2, ... , €n. By representing a vector a from L as a linear 
combination of the vectors of the basis: 


a = kiei + loei... + Knen 


we can associate a with a line of numbers Ki, k2, ... , Ky which 
are the coordinates of a in the basis ej, e, ... , en, thereby, 
associating a with an arithmetic vector from R". 
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The properties of the coordinates of a vector in terms of the 
basis imply that when vectors from L are added, their cor- 
responding vectors from R" are also added, and when a vector 
from L is multiplied by a scalar, its corresponding arithmetic 
vector is also multiplied by that number. Thus, we can say that 
under juxtaposition 


a (ki, ko, ECC RS | Kn) 


the space L transforms into an arithmetic vector space R", R" 
being a “numerical model" of L. Thus, everything we proved 
earlier for R" must be valid for L. Specifically, 

(1) any two bases of L contain the same number of vectors, 

(2) if a basis of L consists of n vectors, then any system of 
more than n vectors in L is linearly dependent. 

6. Dimensions of a vector space. 

Definition. The number of vectors of a basis of a vector 
space L is called the dimension of L. If the number of vectors 
is n, then L is said to have dimension n (or to be n- 
dimensional). 

If R" contains a basis of n vectors, then R" has dimension 
n or is said to be an arithmetic n-space. We know that any sys- 
tem of n + 1 or more vectors in R" is linearly dependent and 
we can give another definition of dimension: the dimension of 
a space L is the maximal number of linearly independent vec- 
tors in L. 


Exercises to Chapter 10 


1. Find the linear combination 2a; — a2 + 3a3 of the vectors a; = (1, —1, 
0, 45, a2 = (16, 4, 7, —2), and a3 = (5, 2, 2, —3). 

2. Find the vector 5a; — 222 + 7a; if a: = b; — 2b2, a2 = 3b; + bo, 
a3 = —b + b; + bi and bi = (0,0, 1), b2 = ¢-I, 2, 3), bs = (-1, 1, 0). 

3. Find the vector x from the equation a + 2b + 3c + 4x = 0, where 
a= (5, —8, —1), b = (2, -1, —4), c = (-3, 2, - 5). 

4. Find the vector x from the equation 3(a — x) + 2(b + x) = 5(c + x), 
where a = (2, 5, 1, 3), b = (10, 1,-5, 10), c = (4, 1, -1, 1). 

5. Show without calculation that each system of vectors is linearly de- 
pendent: 

(a) a 

(bai 


(1, 1, 0, 1), a2 = (7-3, -3, 0, - 35, 
(l, 2, 3, 4), a2 = (4, 3, 2, 1), as = (5, 5, 5, 5), 
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(c) a = <l, 2, 3>, a2 = (3, 2, 1, a = (l, 1, 1). 

6. Show without calculation that the system of vectors is linearly in- 
dependent: 

(a) a1 = (1, 0, 0, 0), à? = (0, 1, 0, 0), a3 = (0, 0, 1, 0), ag = (2, 3, 4, 5). 

(b) à; = (0, 2, 55, a = (0, 0, 3), a = (—-1, 4, 7). 

7. Given the two vectors a; = (l, 2, 3, 4), and a? = (0, 0, 0, 1), choose, 
without calculating, two more vectors a; and a4 such that the system a), a2, 
33, a is linearly independent. 

8. Find all the values of A such that the vector b can be represented as a 
linear combination of a; and az: 


a = (3, -1) a; = (1, 0, 0) a, = (3, 4, 2) 
(1) a2 = (5, -2) (2) a = (0, 1,0) (32 = (6, 8, 7) 
b = (2, X b = (0,0, X) b = (9, 12, X) 


9. Find whether each system is linearly dependent: 

(a) a = (l, 1, 1), a = (1, 2, 3), a = <l, 3, 3), 

(Da = (1, 1,1, 1), a2 = (1, 21,1, 15,3: = (2,3, 1,4), a = 2, 1, 1,3), 

(c) a; = (2, 3, -4, 1), a = (1, —2, 4, 0), as = (0, 0, 5, 1). 

10. Two vectors a; and a; have the coordinates — 1, 2 and 3, — 5, respectively, 
in a basis of a two-dimensional vector space L, and the coordinates of vector 
b are 1, 1. Show that a; and az constitute a basis and find the coordinates 
of b in this basis. 

11. Three vectors ai, a2, and a; have the coordinates 1, —1, 0; 1, 2, 3; and 
0, 1, —1 respectively in a basis of a three-dimensional vector space L, and 
the coordinates of vector b are 6, 6, 6. Show that a1, a2, a3 constitute a basis 
and find the coordinates of b in this basis. 


Chapter 11 


MATRICES 


We already used matrices in Chapter 3 when considering the 
determinants of square matrices and Chapter 9 when solving 
systems of linear equations by Gaussian elimination. In fact, 
matrices are extensively applied in science and engineering. 

We recall what a matrix is. 

Definition. An m x n matrix is a set of numbers arranged 
in a rectangular array of m rows and n columns and usually 
enclosed in parentheses. 

For instance, 

0 -13 
2 5 3) 
is the 2 x 3 matrix. 

An n x n matrix, which has the same number of rows and 
columns is called a square matrix of order n. 

The numbers constituting a matrix are called the elements 
of the matrix. 

We use the usual notation and denote the element in the ith 


row and jth column by aj;. Thus, the general form of an m x n 
matrix is 


aii — i2 Qin 
a21 02 “an 
Gmi Am2 Amn 


We shall denote a matrix by a single upper-case letter in bold 
face, A, B, C, and so on. 


16— 366 
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11.1. RANK OF A MATRIX 


1. Definition of the rank of a matrix. 
Given an m x n matrix A 


ai — di? ain 
A=] a an 2n 
Ami Am Amn 


Each row of A can be considered as an arithmetic vector having 
n coordinates, ie. a vector from R”. We denote these vectors 
by Ai, Ao, ..., Am: 

Ai = (dii, 12, ..., Gin? 

A2 = (zi, G22, ..., Aen) 


Am = (dmi, Am2, ... amn? 


Definition. The rank of the matrix A is the maximum num- 
ber of linearly independent vectors in the system Aj, A2, ..., 
Am of its rows. 

When all the elements of a matrix A are zero, the order of 
A is zero. 

Thus, the rank of a matrix is a number r such that 

(1) the matrix contains r linearly independent rows, 

(2) any r + 1 rows of the matrix are linearly dependent. 

Example. Consider the 3 x 4 matrix 


10 3-2 
A-2[-11.4 3 
21 13 -3 


The first two rows A; = <1, 0, 3, —2) and A; = (- 1, 1, 4, 
3) are linearly independent since the coordinates of A, are not 
proportional to those of A». At the same time the system of 
the rows Aj, A2, As is linearly dependent since A; = 3A, + Az 
(check this). Hence, the rank of A is two. 

2. Two propositions on the rank. 

(1) Suppose any r rows of A are linearly independent. If the 


Ch. 11. Matrices 243 


rank of A is r, then all the remaining rows can be represented 
as a linear combination of the r rows. 

a Adding any of the remaining rows to the r rows yields 
a system of r + ] rows which, by the definition of rank, must 
be linearly dependent. According to property 3°, Sec. 10.2, of 
linear dependence, it follows that the added row can be ex- 
pressed as a linear combination of the r rows. m 

(2) Suppose any r rows of A are linearly independent. If the 
remaining rows can be represented as a linear combination of 
the r rows, the rank of A is r. 

a The proof is based on the concept of a basis of a finite 
system of vectors. 

A basis of a system of vectors a1, a2, ..., ap is a linearly 
independent part of the system (a subsystem) such that any vec- 
tor of the system can be represented as a linear combination 
of the subsystem. The following theorem is valid, and it can 
be proved in the same way as the theorem on bases (Sec. 10.4). 

Theorem. Any two bases of a given system of vectors consist 
of the same number of vectors. 

This theorem helps us prove proposition (2). Suppose we 
have r rows of A, say Ai, A2, ..., A,, such that the assump- 
tions in (2) are satisfied, i.e. they are linearly independent and 
any row of A can be represented as linear combination of the 
r rows. Then, these r rows constitute a basis of the system of 
all the rows of A. Let us consider any r + 1 rows of A. If they 
were linearly independent, this would mean that there is a basis 
of more than r vectors in the system of all rows of A, which, 
according to the above theorem, is impossible. Hence it follows 
that any r + 1 rows of A are linearly dependent, and the rank 
of A isr. m 

3. Properties of the rank of a matrix. 

1°. The rank of a matrix A is not changed, if we add a row 
(say, next to the last row) which is a linear combination of the 
rows of A. 

o We denote the new matrix by A. Let us prove that the 
ranks of A and A are the same. 

Suppose that the rank of A is r. This means that A must 


16* 
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contain r linearly independent rows, say, the first r rows Aj, 
A», ..., Ar. The remaining rows of A can be represented as 
a linear combination of Ai, Ao, ..., A,. This implies that the 
added row can also be expressed as a linear combination of 
A1, A2, ..., Az; by assumption, it is a linear combination of 
the rows of A. In other words, all the rows following the first 
r rows in A can be expressed as a linear combination of Aj, 
A2, ..., Ay. Thus, the rank of A is also r. m 

Property 1 can be formulated in a different way: the rank 
of a matrix does not change if we delete a row which can be 
represented as a linear combination of the remaining rows of 
the matrix. 

o In fact, if we denote the initial matrix by A and the matrix 
obtained by deleting the row by A, we can see that À can be 
obtained from A by adding a row which is a linear combination 
of the rows of A. But in this case, according to property 1°, 
the ranks of A and À must be the same. m 

29. The rank of a matrix is left unchanged if we add a linear 
combination of the other rows to any row. 

o Suppose we add the second row of A multiplied by a 
number K to the first row. The result is the matrix À whose 


rows are Ai + kA2, As», ..., Am (1) 


The rank of A is not changed if we add the row Aj; indeed, 
A; is a linear combination of rows (1), Ai = (Ai + kA) — 
kA»). We obtain a matrix with rows Aj, Ai + KA», Ao, ..., 
Am. The rank of this matrix is again left unchanged if we delete 
its row Ai + KA» (since this row is a linear combination of the 
other rows). The result is a matrix with rows Ay, Ao, ..., Am, 
that is, the initial matrix A. Thus, the ranks of A and A are 
the same, m 

39. The rank of A is not changed if we add a zero row. 

D Azero row can always be considered as a linear combina- 
tion of the rows of A (0 = 0-A, + 0-A; + 0-Am), and there- 
fore property 3? follows from property 19. m 

We can give another formulation of property 3°: the rank 
of A is unaltered if we delete a zero row (of course, provided 
there is such a row). 
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11.2. PRACTICAL METHOD FOR FINDING THE RANK 
OF A MATRIX 


In order to find the rank of a matrix, we can apply Gaussian 
elimination to a given matrix A and not to a system of 
equations. 

Definition. The operations in Gaussian elimination are called 
elementary operations. The elementary operations for matrices 
are 

(1) interchanging two columns, 

(b) deleting a zero row, 

(c) adding one row multiplied by a number to another row. 

The following theorem is valid. 

Theorem. The rank of a matrix is not changed during 
elementary operations. 

a The statement is obvious for elementary operations of 
type (a); it follows from properties 2° and 3° (see Sec. 11.1, 
Item 3) for elementary operations of types (b) and (c). m 

The main idea of the practical method for computing the 
rank of a matrix is that the elementary operations can reduce 
matrix A to the form 


bi bi2 ... birbi,r+1 ... Bin 
B={ 0 bn... bzrbz2,r+1 ... bzn 
0 O she Bebe ice Dm 


in which the diagonal elements bii, b22, ..., brr are nonzero, 
while the elements below the diagonal are zero. This form of 
a matrix is called the echelon form. Then we can immediately 
deduce from the echelon form of matrix that the rank of A is r. 

In fact, the rows of B are linearly independent since they 
form the echelon form of vectors (see Sec. 10.3), and thus the 
rank of B equals the number of its row, i.e. r. Since the rank 
remains unchanged by elementary operations, we know that the 
rank of A is also r. 

Rule. For finding the rank of a matrix A, we should reduce 
it to the echelon form B using elementary operations. The rank 
of A equals the number of rows in B. 
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Example 1. Find the rank of the matrix 


-13 
A [35 
=3 1 
=5 7 


© Since ai, # 0, we can make all elements of the first 
column below a1; vanish. To do this, we multiply the first row 
by —3 and add the result to the second and third rows, and 
then we add the first row multiplied by — 5 to the fourth row. 
The result is the matrix  . 


=I. 3 3 2 5 
0-4 -7 -3 -M 
0 -8 -14 -6 -22 
0 -8 -14 -6 -24 


Noting that 22? #0, we make all elements of the second 
column below a22 vanish. To this end, we add the second row 
multiplied by —2 to the third and fourth rows. We have the 
matrix 


=i 3. 3 2 5 
0-4 -7 -3 -Il 
0 0 0 0 0 
0 0 0 0 =2 


from which we delete the zero row. The third element in the 
last (now the third) row is zero, however, we can make it non- 
zero by interchanging the third and fifth columns. Then we get 


the matrix 
-1 3 $02: 03 
B- 0 -4 -11 -3 -7 
0 4-2 0 0 


which is in echelon form. The rank of B is three; consequently 
the rank of the initial matrix A is also three. @ 
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Example 2. Find the rank of the matrix 


-143 a2 
A={-2751 3 
—1218a 


for different values of the parameter a. 
O We use the elementary operations 


-1 4 3 a 2 
A~ 0-1-11-2a -1 
0-2-28-a a-2 


-] 4 3 a 2 
~ 0 -1 -1 1-2a -1 
0 0 06+3a a 


hence it follows that if at least one of the numbers 6 + 3a or 
a is nonzero, then the rank of A is three, and if both numbers 
are zero, then the rank is two. But a and 6 + 3a cannot both 
be zero simultaneously. Thus, the rank of A is three for any 
a. 0 


11.3. THEOREM ON THE RANK OF A MATRIX 


Suppose we omit some k rows and k columns, k < n and 
k « m, from the matrix 


11 a2 ... Ain 
A=] @1 42 An 
Ami am2 Amn 


The elements at the intersection of these rows and columns 
form a square matrix A’ of order k (a submatrix of A). Its 
determinant |A'| is called a kth-order minor of A. 

Nonzero minors of a matrix are especially interesting. The 
theorem given below relates the rank of a matrix and a 
maximal-order nonzero minor. 

Theorem (on the rank of a matrix). The rank of a matrix 
equals the maximal order of the nonzero minors of the matrix. 
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In other words, if the rank of a matrix A is 7, then A neces- 
sarily contains an rth-order nonzero minor, while all the minors 
of order r + 1 or more are zero. We omit the proof of the 
theorem. 

Corollary 1. The maximum number of linearly independent 
rows equals the maximum number of linearly independent 
columns. 


o In fact, we consider a matrix A and its transpose A* 


ai art amı 
A* = | a2 an Am2 
Ain Q2n Amn 


(A* is obtained by interchanging the rows and columns in A). 
In order to prove the proposition it is sufficient to verify that 


rank A = rank A* (1) 


In fact, the left-hand side of (1) expresses the maximum num- 
ber of linearly independent rows in A, while the right-hand side 
is the maximum number of rows in A*, which is, obviously, 
the maximum number of columns in A. 

Thus, it remains for us to prove (1) which by the theorem 
on the rank of a matrix is equivalent to the following problem: 
prove that the maximal orders of the nonzero minors in the 
matrices A and A* are the same. This last proposition follows 
directly from the properties of determinants (i.e. the transposi- 
tion of a matrix leaves its determinant unchanged), since by 
transposing A we exchange each minor of A by the “trans- 
posed" minor and hence the minor is unaltered. m 

Corollary 2. If the determinant is zero, then its rows are 
linearly dependent. 

O We consider a square matrix A such that |A| = 0. Then 
the maximal order of the nonzero minors must be less than 
n (n being the number of rows), hence, rank A < n. This means 
that the rows of the matrix are linearly dependent. m 
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11.4. RANK OF A MATRIX AND SYSTEMS OF LINEAR 
EQUATIONS 


Suppose we have a system of linear equations 


1X1 + dizX2 9... + AinXn = bı 
G21X1 + d22X2 + ... + AanXn = by (1) 
Ami X1 + Am2X2 + ... + AmnXn = Om 


We arrange the coefficients of the unknowns in matrix form 


a daa Qin 
A=] ay d Q2n 
Ami Am2 Amn 


and this is called the coefficient matrix of (1). 


Suppose system (1) is consistent, then Gaussian elimination 
yields the system in the echelon form 


bixi + bX +... + birXr + bisr+1Xr+1 +... + DinXn = C1 
boxà + ... + barXr + Dare iXrea +... + Dontn = C2 


bX, + brn 41X41 SE usce DnXn = Cr 
Its coefficient matrix is 


bu Dia... Obie s.s Bin 
B= 0 by... b2rbz2,r+1 ET 


System (1) is obtained from (2) by elementary operations. 
Now, applying the same elementary operations to the matrix 
A of (1) yields the matrix B. We know that elementary opera- 
tions do not change the rank of a matrix, thus, the rank of 
A is equal to the rank of B. Since the rank of B equals the 
number of its rows, ie. r we can write 


rank A =r 


250 Part Two. Linear Algebra 


Reading this equation “from right to left” (r rank A) we may 
conclude that if system (1) is consistent, then the number r of 
the equations in an equivalent system (2) obtained from (1) by 
Gaussian elimination, equals the rank of the matrix A of (1). 
Or alternatively, if the system is consistent, then the number 
of free unknowns is n — r, where r is the rank of the matrix 
A of the system. 


11.5. OPERATIONS WITH MATRICES 


The study of operations with matrices is an important part 
of matrix theory. Multiplication of matrices, which is similar 
in certain respects to multiplication of numbers, is especially 
interesting and underlies matrix algebra which is frequently 
used in mathematics and its applications. 

1. Addition of matrices. 

Suppose A and B are two matrices of the same dimension, 
ie. having the same number of rows and the same number of 
columns, 


à dn Qin bu bun Din 
A-2[a a Qs \ B=] b bn ban 
ami Am amn bmi bm Dyn 


ai+ dy a2 + bia ... Ain + Din 
A+Be= 421+ by an + bzn ... on + Dan 
Omi + bmi am + bm Amn + bmn 


In other words, in order to add two m x n matrices we should 
add their corresponding elements. For instance, 


1 2 -1 -2 00 
-1 34+ 1-2})=[01 
4-2 1 3 5 1 
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An m x n matrix with all its elements being zeros is called 
an m x n zero matrix and denoted by 0 (strictly speaking, we 
must denote it by Om,» to show its dimension). Obviously, 


AT0-A 


for any matrix A of the same dimension as 0. 

2. Multiplication of a matrix by a number. 

To multiply a matrix A by a number k is, by definition, to 
multiply all its elements by that number: 


kay kaiz es kain 
kA = kar, kaz dx Kaan 
kami kam2 eae Kamn 


For instance, 
3. ] 1X (3 =3 
0 2 0 6 
It can easily be proved that 
k(A + B) = kA + KB 


for any two matrices A and B of the same dimension. 

3. Matrix multiplication. 

This is a special operation in which any two matrices A and 
B such that the number of columns in A equals the number 
of rows in B yield a third matrix C. 

Suppose we have an m x n matrix A 


au — i2 ain 
A= an 022 ... An 
Ami Am2 Amn 


and an n x k matrix B 
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We note that, by definition, the number 7 of columns in A 
equals the number 7 of rows in B, or in other words, the length 
of a row in A equals the height of a column in B. 

Definition. The product AB of two matrices A and B is a 
matrix C whose elements are such that 


Cy = aubyj + anb +... + inDnj 


The rule for finding cj can easily be remembered: every ele- 
ment of the ith row of A and the corresponding element of 
the jth column of B 

bij 

bij 

Gd ... im | 

bnj 
are multiplied and then the results are added together. Since 
we have m rows in A and k columns in B, the double subscripts 
of cj will run over i from 1 to m and over j from 1 to k, whence 
it follows that C is an m x k matrix. 

To iliystrate, we compile a table showing the numbers of rows 
and columns in the matrices A, B, and AB: 


Number of rows 


Number of columns 


i m noom 
(vos can remember this table thus: "HE dme ) 


3 -2 0 -1 
Example 1. (G 2) : n >) 


_ (3-0 + (-2)0 er E e E 
“(10440 — L(-D442 
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2 
11 -3 _ (1-24 1-1 + (-3)-1 
Example 2. ( 2 3) ie 2-1 + ta) 
-10 
Example 3. 27 ie j 
35 


~1-6 + 0-(—4) (-1}-2 + 0:2 -6 -2 
-[ 26+7(-422 +72)=(-16 18 
3-6+5-(-4)3-2 +5-2 -2 16 


11-3 51 ; : 
Example 4. (i 2 » (C 2) the product is meaningless 


since the length of the row in the first matrix differs from the 
length of the column in the second one. 


11.6. PROPERTIES OF MATRIX MULTIPLICATION 


1. Noncommutativity. 

It is advisable to compare matrix multiplication and number 
multiplication. 

Matrix multiplication is not commutative in contrast to num- 
ber multiplication. Generally, this means that AB z BA. 

To illustrate, we consider the two matrices 


Since the length of a row in A and the height of a column 
in B are the same, the product AB is true, but the product BA 
is undefined. 

There are pairs if matrices A and B for which both products 
AB and BA are defined, but AB z BA. For instance, 


(5:9) - Go) 
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11\ /01\ _ (02 
00 01 00 
2. Associativity. 
Matrix multiplication is associative. This means that given 


three matrices A, B, and C, if one of the products (AB)C and 
A(BC) exists, then the other also exists and 


(AB)C = A(BC) (1) 


while 


u Suppose A and B are two matrices with the product AB, 
and C is a matrix for which (AB)C is defined: 


Number of rows 


Number of columns 


Then the product A (BC) is also defined. Let us prove equa- 
tion (1). 

We denote the matrix AB by P and BC by Q. Let us prove 
that PC = AQ. 

The element of PC in the ith row and the jth column is 


k k n 
2. PigCaj = p ( 5 diabas) Cgj = Didiabascai (2) 
= =1 «zl 
The subscript œ on the right-hand side of (2) varies from 1 to 
n and 6 varies from 1 to k. 
The element of AQ in the same row and column is 


n n k 
2 disQaj = 2j di © bascas) = J liabagCgj 
a= a=1 = 
The right-hand side of this equation is the same as that in (2). 


Thus, the elements at the same places in the matrices PC and 
AQ are the same. Hence, PC = AQ. s 
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Because matrix multiplication is associative, we can write 
ABC without specifying which product (AB)C or A(BC) is 
meant. 

3. Distributivity. 

Let us prove that matrix multiplication is distributive with 
respect to matrix addition. Since matrix multiplication is not 
commutative, we have 


A(B + C) = AB + AC 
(B + C)JA = BA + CA 


o We only need to prove the first equation since the second 
can be proved by analogy. We assume that B and C have the 
same dimension and the number of columns in A and the num- 
ber of rows in B are the same. 

The element of A (B + C) in the ith row and the jth column is 


ai (bij + Cj) + Gi2(brj + C2j) a S se Gin(Bnj + Cnj) 


where n is the number of columns in A (which equals, by as- 
sumption, the number of rows in B and C). Removing brackets 
yields 


(aiibi + ai Doj + ... + GinDnj) + (aCi) + Gi2Crj 
To... + dinCnj) 


The first sum is the element in the Ath row and jth column 
of AB and the second sum is the analogous element of AC. m 


11.7. INVERSE OF A MATRIX 


A more profound analogy between matrix multiplication and 
number multiplication is observed when considering square 
matrices. We assume that all the matrices discussed below are 
square having the same order n (the number of rows or 
columns), and this means that AB is always defined. 

1. Identity matrix. 

We know that the number 1 is such that when multiplied 
by a number a gives a again: 


al=l-a=a 
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An n X n matrix called the identity matrix with the form 


has a similar property. It is easy to prove that 


AI - IA - A 
whatever the matrix 
011 an Qin 
A=] @1 a ... Mn 
Qn) Am Ann 


o In fact, the element of AL in the ith row and jth column is 
Gi O +- + @ig-1:0 + aye l tagr t... + Gin 0 ai 


whence it follows that AI = A. Similarly, we can prove that 
IA- A. m 

2. Inverse matrix. 

An important property of the multiplication of numbers is 
that any nonzero number a has an inverse number b such that 


ab=1, ba=1 


Matrices have a similar property when the condition that the 
determinant of the matrix A is nonzero is substituted for the 
condition that a # 0. 

Definition 1. A square matrix A is said to be nonsingular 
if its determinant is nonzero: 


JA] # 0 
and the matrix is said to be singular if |A| = 0. 


Definition 2. Given an n X n square matrix A. Ann x n 
matrix B is called the inverse of A if 


AB=I, BA-I 


Ch. 11. Matrices 257 


The inverse of A is usually denoted by A ^!. Thus, 
AA“! =I, A^ClIA-I 


The following proposition is valid (we shall accept it without 
proof): a singular matrix has no inverse. 

3. Finding the inverse of a nonsingular matrix. 

Suppose 


Q1 di Qin 
A=] @ 2 Q2n 
äni an2 ann 


is a nonsingular matrix, i.e. 
|A| #0 


Theorem. /f A is a nonsingular matrix, then 


1 Ay An An 

B =— | Ar Ax An 
BD Uu ee 

Ain Aon Ann 


is the inverse of A. 

Here Aj is the cofactor of aj; in the determinant |A]. 

Note the special arrangement of Aj; in the matrix B: Aj is 
in the jth row and /th column and not in the ith row and jth 
column. 

o We should prove the equations AB = I and BA = I. We 
start with the product AB and putting AB = C we write 


üi —Gdi2 Qin 
C=AB=f] an arn An 
Qni an2 Ann 

l An Án Anı 

x — Ai A22 Arn 

HEN 0 d 

Ain Aon Ann 


17— 366 
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By the rule of matrix multiplication, the element in the ith row 
and jth column in C is 


-e y 
A| 


The expression equals 1 for i = j, since in this case the ex- 
pression in the parentheses is the expansion of the determinant 
|A| by the ith row; when i x j, the equation is zero, since the 
expression in parentheses is the sum of the elements of the ith 
row in [A| multiplied by the cofactors of the corresponding ele- 
ments of the jth row (see property 7°, Sec. 3.6). Thus, cj is unity 
if i= j and is zero if i z j, which implies C = I. m 

Example 1. Given the matrix 


ke (1) 


verify whether it has the inverse and find A~!. 
O We have 


Cij (aii AA + anA Forax F GinÁ jn) 


UM) 


iua ME 


consequently, the matrix A ^! exists. In order to determine it, 
we first find the numbers Aj; = —1, Ai? = —4, Az = —2, 
A22 = —7. Then using (1) we can write 


ica 


Example 2. Given the matrix 
2. 2:3 
A= 1 -1 0 
-1 21 


verify whether it has the inverse and find it. 
O We have 
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Since |A| # 0, A has the inverse A ^ !. 


We have 
An = | i] = Au=- |. |= -! 
-1| _ ze 2:53: (ee 
Ass | 1 JE An = BILE 
ES 23 Ev 32 
Ass | iilos Az = iie 6 
2 = 2:3| _ 
an= | 1o] -* A32 = lio] 23 
ale 2| n 
Ane |? _2| = ~4 
Consequently, the inverse is 
-1 4 3 1 -4 -4 
A'!--[-1 5 3)= 1 -5 -3 
1 -6 —4 -1 6 4 


We advise the reader to prove the equalities AA ^! = I and 
A^"A-ILeO 


11.8. SYSTEMS OF LINEAR EQUATIONS IN MATRIX FORM 


The operation of matrix multiplication opens a new insight 
into a simple, but nevertheless important, problem of linear al- 
gebra, namely, solving a system of linear equations. 

Suppose we have a system of m linear equations in n 
unknowns (an m x n system). For the sake of simplicity, we 
consider the special case when m = 2, n = 3, that is, a system 
of the form 

ae + Q12X2 + aX = bı (1) 
@21X1 + 22%. + 23X3 = bz 


17* 
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By introducing the matrices 


Q11 G12 013 bi z 
A= , B= », X=i Q 
421 A22 Q23 bz 
we can write system (1) as a single matrix equation 


AX=B 2) 


Indeed, the product AX is the matrix with a single column: 


a11Xı + Qy2X2 + Q13X3 
AX = 
21X1 + Q22X2 + 023X3 


whose elements are the right-hand sides of (1). Equating them 
‘to the corresponding elements of the matrix B yields system (1). 
Equation (2) is the matrix notation of system (1). 
For instance, the matrix notation of the system 


— 2x1 + X2 + 5x3 = 4 
X1 —-7x3-21 


-21 5\/u\  f4 
(To 5) = @) 

The matrix notation of an m x n system is still of the form 
in (2), except that A is an m x n matrix, B is the column of 
m elements, and X is the column of n elements. 

1. An n xX n system. Notation of a solution in terms of an 
inverse matrix. 

Systems in which the number of equations and unknowns 
is the same (n X n systems) are of particular importance. In 
this case A is an n X n square matrix. Suppose A is nonsingu- 
lar, ie. its determinant is nonzero. Then A has the inverse A ^ !. 
By using the inverse, we can solve equation (2), namely, mul- 
tiplying (2) from the left by A ^! yields 


A^NAX)-2 A^!B 
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or, because matrix multiplication is associative 
(A "!A)X = A^! B 
But since A ^!A = I and IX = X, we have 
X-A-!B (3) 


We cannot yet say that (3) is equivalent to (2); we can only 
assert that any solution X of (2) also satisfies (3). 
In fact, the matrix X defined by (3) satisfies (2) since 


A(A^!B) = (AA ` 5)B- IB = B 


Thus, (3) is the matrix notation of the solution for an n x n 
system having a nonsingular matrix A. 

Remark. Formula (3) is not so convenient that it greatly sim- 
plifies the solution of an n x n system with a nonsingular 
matrix A since we first have to find the matrix A ^!, which 
is in itself difficult. However, (3) is interesting from the view- 
point of theory. Gaussian elimination is still the best method 
for solving an n x n system. 

Nevertheless, we can illustrate an application of (3). 

Example. Solve the system of equations 


xi + 3x3 = 1 
Sxi + 3x2 + 7x3 = 1 
3x1 + 2x. + 5x3 = 1 


© We have 
103 
A= 50327 
32.5 


The determinant |A| is 4, hence A is nonsingular. 
We calculate the inverse A ^! using formula (1) from 
Sec. 11.6. We have 


3-57 


Au=|5 5 


= 6 


E auc a 
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i] 
an=- [i| = -4 ^u = | i 
ETE 
Ass [33] = 9, Ass - [1i] s 
an= [15] =3 


1/4 6/4 —9/4 
A~! = | —4/4 —4/4 8/4 


1/4 —-2/4 3/4 


Then we find the matrix X from (3) 
1/4 6/4 -9/4 1 —2/4 
X-|-1 -1 2 1|- 0 
1/4 —2/4 3/4 1 2/4 
Thus, the solution of the system is xi = —1/2, x; = 0, 
X3 = 1/2. @ 
2. Matrix equations. 
The unknown matrix X in equations of form (2) consists of 
a single column, however, there are equations of the same form 
in which X consists of several columns. We consider the case 
when A, B, and X are n x n square matrices, if A is nonsingu- 
lar, then in order to find X we may use the technique applied 


earlier, namely, multiplying the equation from the left by the 
matrix A ^!. Then we obtain 


X-A-!B 


The problem is solved. 
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Note that if the equation is of the form 
YA = B 
(the unknown matrix Y is on the left of A), then we obtain 
Y HAS 


Example. Solve the equation 


(34) *= (5) 
a= GÀ) 


is nonsingular (its determinant is —2). We find the inverse 


© The matrix 


and then the unknown matrix 
È 1 4 -2\ /3 5 -1 -1 
X-A^C!B--. - 
x (3 sites) T 5) : 


Exercises to Chapter 11 


1. What can we say about the rows of a matrix if its rank is one or two. 
2. Find the ranks of each matrix: 


T 10 234 
o ( ) (2) | -12 }, 01-120]. (5 
34 35 113 
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4. 322k =l 
(D [2 1-3 2 —5 
po3 0 —r:-2 
lI 3$ 2-216 


3. Find the rank of the matrix depending on the value of a: 


I a S12 alli 
(012-1 a5], @i lai 
1 10 -61 lla 


4. Find the condition under which both products AB and BA and the product 
AA (which is the square of the matrix A) are defined. 


53 
12/7. 
6. Multiply the matrices in the indicated order: 


1 -2\ f21 21\ f1-2 
0 C EJ. © Gale E 


=42 O71 1 4 2 TE 
@il =45 =2 h21 o] 19 zu 
Zsa OF NIT 1 . 4 
1 


j 5:2 
(5) (0, -1) ,€63-2[|17] 012]|(-34 
0 10 3 


5. Find the matrix A? — 6A, where A — 


7. The notation ABC means (AB)C or , which is the same, A(BC). Find 


3 
-13 01Y/-13 01 
(9630) (6). 
1 
0-1 
2](-34 
(c) A 39 (9 ) 


8. The notation A" means A:A...A (n times). Find the matrices 
4 -1V 4 -1M 2 -1M 
1 2.3(2 > GB " 
o ie E) D C E o ic 2) 
2 -1M 2 -1M Way? 
4 » 6 . 6 
T C E 5) ic =) © Go) 
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9. Find the change in the matrix 


ay bi Cc) 
a2 b; C2 
a3 by C3 


100 100 
0], o[01oe], @[0a0 
0 011 001 


10. Find the matrix A ^! if 


2 -1 cos a -sin a -1 0 
a= C E ines ea oa ea ( 0 2) 


M. Find A ^! if 


013 001 11 -1 
(0A4-1|[235], (DA-2 [010], (A-2121 0 
357 100 11 1] 

12. Using the inverse matrix solve each system of equations: 

x+2y+3z=7 2x- yt3z=1 

(a) 42x- y+ z=9 (b) < 3x- Sy + z= 1 

x- 4y +2z= n 4x — Ty =1 


13. Solve each matrix equation 


li 23 1 x30 
(2) [3 2-47-X=f 10 27], 
2 =) 0 10 78 


Chapter 12 


EUCLIDEAN VECTOR SPACES 


The concepts of a basis, dimension, and coordinate in terms 
of a basis were introduced earlier, and are all related to the 
concept of linear dependence. By considering two- and three- 
dimensional vector spaces (i.e. the sets of all vectors in the plane 
or in space), we can see that the concept of a vector is consider- 
ably extended. For instance, each vector has a length, two non- 
zero vectors form a definite angle, and perpendicular vectors 
exist in spaces. This chapter generalizes these concepts (usually 
called metric) to the case of a vector space of any dimension. 


12.1. SCALAR PRODUCT. EUCLIDEAN VECTOR SPACES 


1. Definition of a scalar product. 
We introduced the operation of scalar multiplication of vec- 
tors in the ordinary space in Sec. 2.3 by the formula 
x-y = |x|-|y|-cos e 


where e is the angle between x and y. The consequences of 
this definition are 


Ix| = Vx-x (1) 
cos e TRS (2) 


from which we can see that the length of a vector and the angle 
between two vectors can be expressed via the scalar product. 
Formulas (1) and (2) suggest a way of generalizing (to an n- 
dimensional case) the scalar product, and so the length of a 
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vector and the angle between two vectors can then be found 
from formulas (1) and (2). 

The scalar product of two vectors in three-dimensional space 
possesses the following properties (see Sec. 2.3): 

19. x-x 2 0, with x-x = 0 only if x = 0, 

2°. x-y = y-x, 

39. (kx):y = k(x:y), where k is a scalar, 

49, (x+y)z=xz4+ yz. 

It is advisable that properties 19-4? remain valid for any 
generalization of the scalar product. 

Definition 1. We say that we can define the operation of sca- 
lar multiplication (or simply, scalar multiplication) in a vector 
space L if any pair of vectors x and y from L can be associated 
with a number denoted by x-y and called the scalar product 
of x and y, properties 17-4? being valid. 

Definition 2. A vector space L in which scalar multiplication 
can be defined is called a Euclidean vector space. 

Properties 1-4? are often called the axioms of the scalar 
product. 

2. Method of a scalar product. 

We cannot say from the definition of a scalar product 
whether we can introduce a scalar product in an n-dimensional 
vector space. The formula 


X'y = Xiyi + Xaya + X33 


from Sec. 2.3 suggests an answer; the formula expresses the sca- 
lar product of two vectors x and y in a three-dimensional space 
in terms of the Cartesian coordinates of these vectors. Using 
a similar formula, we can define a scalar multiplication in an 
n-dimensional vector space L. To do this, we choose a basis 
€j, @2, ..., e, In L and associate a pair of vectors x, y with 
a number 


Xy = Xi + Xyz ^... + XnYn (3) 


where xi, X», ..., X, are the coordinates of x and yi, y», ..., 
yn are the coordinates of y in the chosen basis. Then the proper- 
ties 1°-4° are satisfied. 
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o (1) The expression x:x = x? + x3 + ... + x? is nonnega- 
tive for all values of x1, x», ..., Xn and is zero only if xı = 0, 
X220, ..., Xn = 0, that is, when x = 0. 

(2) That property 2? is satisfied is obvious. 

(3) (kx)-y = (kx) yi + (oo)ys  ... + (Kn) Yn = 
K(x1y1 + X272 + ... + XnYn) = k(x:y). 

(4) (+ y)-z = (x + y1) z + (x2 + y2) z2 + ... + (Xa + 
Yn) Zn = Qazi + X272 + ... + XnZn) + Oni + J2Zz2 + ... + 
YnZn) = XZ + y:z. B 

Thus, (3) defines the operation of scalar multiplication in a 
space L. 

If we choose another basis ej, e3, ..., e; and associate a 
pair of vectors x, y with a number (x-y)* = xiyi + Xy» + 
soo XS (where xi, ..., X; and yi, ..., y, are the coor- 
dinates of x and y in the new basis), then, generally speaking, 
X:y = (x-y)* is not satisfied. Hence it is clear that the operation 
of scalar multiplication can be defined in L in many ways. 

We shall show later that however the operation of scalar mul- 
tiplication is defined in a vector space L, there is always at least 
one basis such that (3) is satisfied. 

3. Properties following from the axioms 1°-4° of a scalar 
product. 

To begin with, we show that two additional properties of ax- 
ioms 3° and 4° are valid: 

31. x-(ky) = k(x-y), 

4. x-(y +z) =x-y + xz. 

Property 3? follows from the chain of equalities 


x-(ky) = (ky)-x = k(y-x) = k(x-y) 
each of which uses the axioms of a scalar product. Property 
4° can be proved in a similar manner, viz., 
x (y+ zZ=(¥tzx=yx+tzx=xXytxz 
By using properties 3° and 3? we can deduce one more 
property: 
31 (kx): (y) = (Dy) 
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Further, it follows from properties 4^ and 4? that 


(X1 + Xo... Xx» (yn t yo +... + Yq) 
= Qu, yi) + Gu, Y2) + ... + (Xp, Ya) 
ie. the scalar multiplication of a sum by a sum is distributive: 
each term of the first sum must be multiplied by each term 
of the second and the results added. This and property 39 yield 
a rule for multiplying one linear combination by another: 
(Kix, + KoXxa +... + KpXp, hy: + by2 ... + laya) 
= Kh(xi- yn) + QGb)u-y + ... + kalap Ya) (4) 
For instance 
(3x1 — 2x2) yi + 4y2) 
= (3-5)(xi-y1) + (3-4)(K1-¥2) + ((—2)-5)(x2-y1) 
+ ((—2)-4)(x2-y2) = IS(xi-y1) + 1269-yz) 
— 10(x2-y1) — 8(X2-y2) 


12.2. SIMPLE METRIC CONCEPTS IN EUCLIDEAN VECTOR 
SPACES 


Having introduced scalar multiplication in a vector space L, 
we can define such concepts as the length or absolute value 
of a vector, the angle between two vectors, and the perpen- 
dicularity of vectors. 

Definition 1. The absolute value of a vector x is the number 


ix] = Vx-x (1) 


Since the radicand is nonnegative by virtue of axiom 1°, the 
root exists. 

Definition 2. Suppose x and y are two nonzero vectors. The 
angle between x and y is the number ¢ defined by 


cos gy = po Ge (2) 


Vx-xvy-y 


and such that 0 € o <S m. 
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The existence of ¢ will be proved later. 

Definition 3. Two vectors x and y are said to be perpendicular 
or orthogonal if their scalar product is zero. 

By viritue of (2), two nonzero vectors x and y are orthogonal 
if and only if the angle o between them is «/2 (cos e = 0). 

The zero vector is orthogonal to any other vector and this 
follows from the chain of equalities 


0-x = (0x)-x = 0(x:xx) = 0 


1. Cauchy-Buniakovski's inequality. 

Formula (2) for finding the angle between two nonzero vec- 
tors x and y requires additional explanation. The point is that 
cos e cannot be any arbitrary number since cos e = a is only 
valid if —1s a < 1. Thus, the angle y between two nonzero 
vectors exists if 


-1«. X «1 
x||y| 


or, which is the same, 


(x-y)? «1 (3) 


Ix? ly? ^ 
Theorem. The inequality 
(xy)? < ixl |y]? (4) 


is valid for any two vectors in a Euclidean vector space L. 
Note that (4) is known as Cauchy-Buniakovski's inequality*. 
When x # 0 and y # 0 (4) is equivalent to (3). 
u Consider the scalar product of the vector y — fx with it- 
self, where ¢ is any number: 


(y — tx) (y — tx) = y-y - ty) — ty) Cx) 


* Cauchy A. L. (1789-1857), French mathematician, the author of fun- 
damental works on mathematical analysis 

Buniakovski V. J. (1804-1889), Russian mathematician, the author of many 

works on statistics including the first Russian textbook on probability theory. 


| 
| 
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We obtain an equation of the form 

(y — tx)-(y — tx) = af? + 28t- y 
where a = x-x, 8 = —(x-y), y = y-y. The quadratic polynomi- 
al in ¢ on the right-hand side of the equation is nonnegative 
for any f, since it is the scalar product of a vector with itself. 


Hence it follows that the discriminant of the quadratic, ie. 
B? — ay, is not positive. Consequently, 


(x-y)? — (x-x)(y-y) «0 


which is (4) in different notation. m 

Thus, we have proved Cauchy-Buniakovski's inequality and 
thereby validated the definition of the angle between two 
vectors. 

If the scalar product is defined in terms of a basis by x-y = 
Xii + X22 +... + XnYn, then  Cauchy-Buniakovski's ine- 
quality assumes the form 


ayi 43)» +... + Xn)” 
< 2 2 2 2 2 2 
€ (xi +x +... Xf + Y e.c yn 


which is valid for all x and y. 


12.3. ORTHOGONAL SYSTEM OF VECTORS. ORTHOGONAL 
BASIS ° 


1. Orthogonal system of vectors. 
Definition. The system of vectors 
81, 32, ..., Ap (1) 
in a Euclidean vector space L is said to be orthogonal if any 
two vectors of the system are orthogonal, that is, 
aaj = 0 

where i and j are any numbers from the set 1, 2, ..., p and 
ix]. 

Obviously, any subsystem of an orthogonal system is also 
orthogonal. 
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What is then the number of vectors in an orthogonal system? 
If we do not require that each vector of the system be nonzero, 
then the system may consist of any number of vectors. For in- 
stance, the system of p equal vectors 0, 0, ..., 0 is orthogonal 
for any p. Therefore, we assume in addition, that all the vectors 
of the system are nonzero. Given this assumption, the number 
of vectors must not exceed the dimension of the space L, i.e. n. 

This follows from the next Lemma. 

Lemma. An orthogonal system of nonzero vectors is always 
linearly independent. 

DU Given an orthogonal system of nonzero vectors (1). Sup- 
pose they are linearly dependent, i.e. the following equality is 
true, 

cia t cat... + Cpa = 0 


where ci, C2, ..., Cp are not all zero, say ci # 0. Then scalar 
multiplication of the last equation by a; yields 


C, (ara) + c6(22:81) +... + Cp(3p:21) =0 


or, since a; is orthogonal to every vector a2, ..., ap, 
Ci (231-21) =0 


But a-a; # 0 since a; z 0. Hence c; = 0. This contradiction 
proves that system (1) is linearly independent. m 

Since a linearly independent system in an n-dimensional vec- 
tor space consists of not more than 7 vectors, the lemma yields 
the corollary: A system of mutually orthogonal nonzero vectors 
in an n-dimensional Euclidean vector space contains no more 
than n vectors. 

This proposition is obvious for n = 3, since we cannot “put” 
more than three orthogonal nonzero vectors into an ordinary 
three-dimensional space (Fig. 118). 


Figure 118 
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Is there an orthogonal system of exactly n nonzero vectors 
in an n-dimensional Euclidean vector space L? Since, according 
to the lemma such a system is necessarily a basis of the space, 
we may put an alternative question: Is there an orthogonal basis 
in L? The answer is given in the next subsection. 

2. Existence of an orthogonal basis. 

Lemma. Suppose we have a system of k vectors in an n- 
dimensional Euclidean vector space L, k « n, then there exists 
a nonzero vector orthogonal to all the vectors of the system. 

a We denote the vectors of the system by a;, a2, ..., Ak 
and the unknown nonzero vector by x. The vector x z 0 and 
is such that 


arx=0, a-x=0,..., ag-x=0 (2) 
We choose a basis ei, e2, ..., e, in L and assume that 
X = Xe; + Xe + see + Xnen 


Then conditions in (2) take the form 


8i:(Xx1ei + X2e2 + ... + Xren) = 0 
a2-(x1e1 + X262 +... + Xe) = 0 
ag: (nei + X2€2 + ... + Xnen) = 0 


Expanding each scalar product and designating oj; = a;-e; 
yields 


das + 01232 +... t OinXn = 0 
O21X| + 022X2 + ... + AnXn = 0 

Ls + Ok2X2 o... + OknXn = O 
This is a. system of k homogeneous equations in n unknowns 
Xis X2, -a.s Xn. Since k < n, the system must have a nontrivial 
solution (see the theorem in Sec. 3.9). Consequently, there is 
a nonzero vector x satisfying (2). m 


Theorem. There is always a basis of mutually orthogonal 
vectors in a Euclidean vector space L (an orthogonal basis). 
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a Let L be n-dimensional. In order to construct an or- 
thogonal basis, we take an arbitrary nonzero vector a;, then 
a nonzero vector az orthogonal to aj, then a nonzero vector 
33 orthogonal to all the constructed vectors a1, a2, ..., @n—1. 
The existence of each next vector is ensured by the previous 
lemma. The resulting system a1, a2, ..., a4 is orthogonal con- 
sisting of n nonzero vectors and, consequently, is a basis of 
L. E 


12.4. ORTHONORMAL BASIS 


1. Definition of an orthonormal basis. 

Definition 1. A vector e is said to be normalized or a unit 
vector if its length is 1. 

Multiplying a nonnormalized vector a (a ¥ 0) by a number 


EM yields the normalized vector; indeed, a scalar product of the 


vector L a with itself is 
a 
1 ) ( 1 ) l 
—a}-(—a} =— (aa) = 
G la| [a 
Transition from the vector a to a a is called the normaliza- 
a 


tion of a. 
Definition 2. Given an n-dimensional Euclidean vector space 
L, the basis 
Elo 292 Bs (1) 


of L is said to be orthonormal if 
(1) the basis is orthogonal, that is, 


erejz0 for izj (2) 
(2) each basis vector is normalized, that is, 
erg = 1 (3) 


An orthonormal basis always exists. In order to form such 
a basis, it is sufficient to take any orthogonal basis a, 382, .. 
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an (the existence of an orthogonal basis was proved in the previ- 
ous section) and then normalize each vector of the basis 


RT 
an| 
2. Scalar product in an orthonormal basis. 

Suppose we have an orthonormal basis ei, €, ..., €n. Per- 
forming scalar multiplication of two arbitrary vectors 
X = X1€&i + KO. +... + Xmen and y= yie + ye +... + 
Yn€n yields 


1 
e = — a, €2 = — 82, ..., € = an 
a la2| 


X:y = (Qnae): y + (x2€2)-y + ... + (Xn€n):y (4) 
The first term of the right-hand side equals 
(161) 0161)  63e)-0262) + ... + (x161): Ones) 
Gay (ei-ei) + 60y3)(ei-e2) +... + (X1Yn)(€1*€n) 
= xyyrl + xy0 9... + xy = xy 


Similarly, we find that the second term is x2y2, and so on. The 
result is 


X:y = Xiyi + X22 c... + Xnyn 


Thus, the scalar product of two vectors in an orthonormal 
basis equals the sum of the products of their respective coor- 
dinates. 


Exercises to Chapter 12 


1. Explain why it is impossible to introduce a scalar product in an z- 
dimensional vector space, with n > 1, using the formula 


xy = xy 


where x; and y; are the first coordinates of x and y in the basis. Which axiom 
of 1°-4° of the scalar product appears to be violated? 
2. Prove the inequality 


Gd + DOR + »2 2 Cay + xy 


(a particular case of Cauchy-Buniakovski’s inequality for n = 2) indirectly by 
considering the difference between the left-hand and right-hand sides. 


18* 
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3. Prove that the previous inequality becomes an equality if and only if xi: 
yi = X2ty2, that is, when the vectors (xi, x2) and (yi, yz) are collinear. 
4. Let a scalar product be defined in R? by 


xy = Xiy + X2y2 + Xsys 


Employing the method given in Sec. 12.3, form an orthogonal basis with the 
first basis vector (1, 1, 1). 
4. Suppose a scalar product in R^ is defined by 


xy = Xiyi + Xoya + Xsys + Xaya 


(a) Verify the orthogonality of the vectors a; = (1, 1, 1, 1) and az = (l, 
1, -1, —1». Construct an orthogonal basis which includes a; and az. Can 
we choose the other two vectors so that their coordinates are +1 or —1? 

(b) Find and normalize a nonzero vector orthogonal to each of the vectors 
41, 1, 1, 1), (l, —1, -1, 1), (2, 1, 1, 3). 


Chapter 13 


AFFINE SPACES. CONVEX SETS AND POLYHEDRONS 


13.1 THE AFFINE SPACE A" 


We noted that an arithmetic vector space R” is an important 
object studied in linear algebra. An arithmetic affine space A" 
is closely related to R". 

The relation between A" and R” is the same as between the 
set of all points and the set of all vectors in an ordinary three- 
dimensional space. Each point in an ordinary space is defined 
(in a given Cartesian coordinate system) by a triple of numbers 
and a pair of points defines a vector. By analogy, the points 
of A" are defined by sets of n numbers and each pair of points 
is associated with a vector from R". 

Definition 1. Any set of n numbers 


Gi, Hi. ..., An (1) 


is called an arithmetic point and the numbers ai, a2, ..., Qn 
are called the coordinates of that point. 

To stress that (1) is considered to be an arithmetic point (and 
not an arithmetic vector) we write it in parentheses: 


(ai, a2, eee an) (2) 


We denote arithmetic points by upper-case letters A, B, C, 
and so on. For instance, we denote point (2) by A and write 
A = (ti, G2, ..-, Qn). 

Definition 2. Suppose A and B are two arithmetic points hav- 
ing the same number of coordinates n, ie. A = (a1, @2, ..., 
Qn) and B = (bi, b2, ..., bn). The arithmetic vector (bi, — a, 
b2 — G2, ..., Dn — Gn) from R” is called the vector AB and 
A is its initial point and B is its terminal point. 

This definition implies that the coordinates of AB are the 
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differences of the corresponding coordinates of the initial and 
terminal points. 

The point (0, 0, 0) is called the origin and denoted by O. 
Obviously, whatever the point A, the coordinates of OA coin- 
cide with the coordinates of A. The vector OA is the radius 
vector of A. 

Definition 3. The set of all arithmetic points having n coor- 
dinates in which each pair of points A and B is associated (by 
the technique discussed) with a vector AB from R" is called 
an n-dimensional arithmetic affine space and denoted by A". 

We shall frequently omit the word "arithmetic" in our later 
discussion. 

Theorem. The equality 


AB + BC = AC 
is valid whatever the points A, B, C from A". 


a Let A = (à, à, ..., Qn), B= (bi, b2, ..., bn), and 
C = (Cis, 5. c). Then 
AB = (b — a1, b2 — @2, ..., bn — Qn) 


BC = (à — bi, e — bz, ..., Cn — by) 
whence it follows that 


AB + BC = (bi — a) + (c1 — bi), (b2 — a) + (6 — b2), 
— (bn — an) + (Cn — bs) 
= (C1 — 41, C2 — d», ..., Cn — An) - AC a 

Laying off a vector. This is an important concept in the ge- 
ometry of A” or affine geometry. 

Definition 4. Let A = (ai, a2, ..., Qn) be a point from A" 
and p = (pi, p», ..., Pn) be a vector from R". To lay off a 
vector p from a point A is to find a point B such that 


AB =p (3) 


_ Since OB = OA + AB, (3) is equivalent to the equation 
OB = OA + p. Consequently, denoting the coordinates of B 
by bi, b2, ..., bn yields 

bh =a + pi, b2 = @ + pr, ..., Dn = An + Dn (4) 
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which means that the coordinates of B are the sums of the coor- 
dinates of A and the corresponding coordinates of p. 
Notation. We write the point B obtained by laying off a vec- 
tor p from the point A as A + p. 
According to (4), 


A +p= (a + pi, @ + Pr, ..., Gn + Dn) 


13.2. SIMPLE GEOMETRIC FIGURES IN A" 


1. Straight line 

A straight line in an ordinary space can be defined by a point 
and a direction vector. The following definition is the generali- 
zation of this fact. 

Definition. Let Xo be a fixed point in A" and p be a fixed 
vector from R”. The set of points X of the form 


X - Xo ^ tp () 


with ¢ a number, is called a straight line through X» in the direc- 
tion of p, or simply, a straight line. 

If Xo = (xf, xf, ..., x9) and p = (pi, P2, ..., pn), then 
x? + tp, x8 + tp2, ..., X? + tps are the coordinates of point 
(1). Therefore, we can say that the parametric equations ofa 
straight line through Xo in the direction of p have the form 


X= Xbt tpi, X) = XD + tpo, ..., Xa XB + fps 


2. Line segment. 
Definition. Let Xo and X1 be two points in A". The set of 
points X of the form 


X= Xo + tXoXi (2) 
with ¢ varying over the interval 0 < ¢ < 1 is called a line seg- 
ment XgX,. 


The points Xo and X, are called the end points of the line 
segment XoX1. Obviously Xo corresponds to ¢ = 0 and X; to 
i-a 

Theorem (on a line segment). A line segment XoX, in A" 
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consists of the points X such that 
OX = s0X, + (1-5 OX; 


where s varies within the limits 0 « s < 1. 
o It follows from (2) that 


OX = OX) + tXoXi = OX, + t(OX, - OX) 
= (1 — t) OX, + tOX, 


Putting 1 — ¢ = s yields the statement of the theorem. m 

3. Different kinds of plane in A". 

We can introduce the concept of a plane, and not only a 
straight line, in the space A". However, the dimension n may . 
be larger than three. The point is that there are different kinds 
of plane in A", where m 3: two-dimensional, three- 
dimensional, and finally (n — 1)-dimensional. 

Definition. Let k be a real number. A plane of dimension 
k (or a k-dimensional plane) in A" is the set of points X of 
the form 


X= Xo + tipi + fpa t+... + tepe 


where Xo is a fixed point, pi, pz, ..., px are fixed linearly in- 
dependent vectors from R”, and fi, t2, ..., fy are arbitrary 
numbers. 

The definition may include the condition that k must be 
smaller than n. Indeed, there are no linearly independent sys- 
tems of vectors whose number exceeds n in R”. The case when 
k = n is of no interest since a linearly independent system of 
n vectors is a basis of R", and, therefore, an n-dimensional 
plane is the set of points of the form 


Xo tp 


where p is any vector from R”; in other words, an n- 
dimensional plane is the entire space A". 

Two kinds of plane in A" are most interesting, namely, planes 
of minimal possible dimension 1 and planes of maximal possi- 
ble dimension n — 1. One-dimensional planes are, essentially, 
straight lines defined earlier. Planes of dimension n — | are 
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0 
Figure 119 


called hyperplanes. When n = 3, a hyperplane is the set of 
points of the form 


Xo + tpi + pz 


where pi and p; are linearly independent vectors from R°. 
Whence it follows that for n = 3 the concept of a hyperplane 
coincides with that of a plane in a three-dimensional space 
(Fig. 119). 

An ordinary plane is defined by an equation of the form 


ax t by+cz+d=0 


Generalizing this fact, we shall give, in the next subsection, 
another, independent definition of a hyperplane and use it in 
what follows. 

4. Hyperplanes. 

Definition. A hyperplane in A" is the set of numbers 
X= (xu, X2, ..., Xn) whose coordinates satisfy the linear 
equation 


axı + G2X2 + 2... + dX. +b -0 (3) 


where a1, 42, ..., Gn, b are fixed numbers and a1, @2, ..., Qn 
are not all zero. 

Both definitions of a hyperplane are equivalent in that we 
define it as a set determined by an equation of form (3) and 
as an (n — 1)-dimensional plane, i.e. the set of points X of the 
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form 
X = Xo + tipi + po +... + Ín - 1Bn - 1 


where pi, P2, ..., Da- 1 are fixed linearly independent vectors 
from R", and the equivalence can be proved. 

5. Half-spaces. 

If a plane in an ordinary space is defined by the equation 


ax + by+cz+d=0 


then each of two half-spaces into which the plane divides the 
space is defined by the respective inequality, namely, 


ax+ by+ecz=d20 
and 
ax+by+cz~+d<0 
Generalizing this fact we make the following definition. 
Definition. Given a hyperplane 
diXi + 2X2 c... + dX. +O =0 


in A”, the following two sets are called half-spaces into which 
this hyperplane divides A”: 


(1) the set of all points X = (xi, x2, ..., Xn) such that 
aX + aX +... + QnXn t+ b 20 

(2) the set of all points X = (xi, X2, ..., Xn) such that 
@X1 + d3X2 to... + OnXn +O SO 


13.3. CONVEX SETS OF POINTS IN A", 
CONVEX POLYHEDRONS 


1. Convex sets in the plane and in three-dimensional space. 

Definition. The set M of points in the plane and in space 
is said to be convex if it contains the line segment AB joining 
its two points A and B. 

Figure 120 illustrates convex and concave sets. 
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Figure 120 


2. Convex sets in A”. 

Definition. The set M of points in A” is said to be convex 
if it contains all the points of the line segment AB joining its 
points A and B. 

Theorem. The hyperplane as well as any of its half-spaces 
into which it divides A" are convex sets. 

Hu We consider a half-space defined by 


diXi + 2X2 +... + dX b 20 (1) 


We shall prove that it is convex. Suppose the points X’ = (xí, 
X2, ..., Xn) and X" = (xf, xz, ..., x4) belong to the indicated 
half-space, i.e. 
aixi + axi +... + anx + b m0. 
MX + dax; e... + nxt bZ0 


(2) 


Let us verify that any point X of the line segment X' X" also 
belongs to that half-space. 

According to the theorem on a line segment (see Sec. 13.2), 
the coordinates of X can be represented as 


xı = sxi + (l —-s)x/ 
= sx; + (1 - s) xý 


Xn = SX, + (l - S) x7 


& 
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Figure 121 


where 0 < s € 1. Substituting these expressions into the left- 
hand side of (1) yields 


ai(sxi + (1 — s) xy) + ax(sxz + (1 — s) xz) 
+... + an(5x_n + 1 -— 5) x) +b 
= s(aix{ + axi + ... + anXa + D) 
+ (0 — s(aixi + ax? + ... + anx + b) 


(note that sb + (1 — s) b = b). The expression on the right- 
hand side of this equation is nonnegative on the basis of (2) 
and since s2 0, 1-—s2>0. 

Thus, we proved that the half-space is convex. The same can 
be proved by analogy for a hyperplane. m 

3. Convex polyhedral domains. Convex polyhedrons. 

A convex polyhedron in an ordinary space can be visualized 
as an intersection of several half-spaces. For instance, we can 
see from Figure 121 that a tetrahedron is the intersection of 
four half-spaces and a cube is the intersection of six half- 
spaces. 

However, the converse is not always true: the intersection of 
several half-spaces can be a convex polyhedron. First, the inter- 
section may be empty (Fig. 122) or it can be an unbounded 
figure (the infinite trihedral angle in Fig. 123). Given the last 
remark, we introduce the following definitions. 

Definition 1. The set of points in A" is called a convex poly- 
hedral domain if it can be represented as the intersection of 
several half-spaces. 

Definition 2. A convex polyhedral domain in A", which is 
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Figure 122 Figure 123 
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at the same time a bounded set in A", is called a convex poly- 
hedron. 

In a bounded set the coordinates of all its points are less 
in absolute value than a constant C: |xi| € C, kl S C, ..., 
[xa] « C. 

Thus, a convex polyhedral domain is defined by a system of 
linear inequalities 


MX, + 2X2 + ... + GnXn - b 20 
djXi + aX% + ... + QnXn + b' 20 


The number of inequalities in the system can be arbitrary. 
Example. Find the convex polyhedral domain in the plane 
defined by the system 


xt+y+120 
x—-2»y -220 
2x -y -420 


(In the case of a plane, we should say a polygonal and not 
“polyhedral” domain.) 

@ We can write the first inequality of the system as 
y 2 —x — 1; it defines the half-plane above its boundary line 
A (Fig. 124). The second inequality is equivalent to 
y S (1/2)x — 1 and defines the half-plane below the boundary 


286 Part Two. Linear Aigebra 


Figure 124 


line h. The third inequality y < 2x — 4 defines the half-plane 
below its boundary line /;. The intersection of these three half- 
planes is a convex polyhedral domain M shown in Fig. 124; it 
is unbounded and has two vertices: Ai(1, —2) and A2(2, 0). 6 


Exercises to Chapter 13 


1. Prove that if A, B, C, D are four points in a space A”, with AB = CD. 
then AC = BD. 

2. Prove that (A + p) + q = A + (p + q), where A is any point from A” 
and p and q are vectors from R”. 

3. Prove that two distinct straight lines in space A” can only have a single 
point in common. 

4. Prove that three points A, B, C in a space A" lie on the same straight 
line if and only if the vectors AB and AC are collinear. 

_5. A point x, in A" divides the line segment X)X2 in the ratio «:@ if 
XX = (a/8) XX. Deduce the formula 


OX = 


OX, it pu OX. 2 
a+ Bp act 
for the radius vector of point X dividing the line segment in the ratio a:f. 
6. Prove that any three noncollinear points A, B, C in space A" belong to 
a two-dimensional plane and that this plane is unique. 
7. Find which of the indicated domains on the plane are convex: (1) y > x’, 


29 2 
Q»«x, Q xsy«V à x«x!«s (9 1 772^ 


x21, 
6) ?-y <1. 
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8. Let M be a domain in the plane and M be its complement (to a complete 
plane). Can it be that the domains M and M are convex? 

9. Find the convex polygonal domains in the plane defined by each system 
of inequalities: 


x-y+I120 x+y-120 
(a) 3x + 2y-620 (b) x-y+120 
—3x-y+920 —-x+y+120 
2x+y 20 

2x-y+7<0 

—-4x+ y+ 1130 

(c) -x+ 3y-7<0 

x-2<¢0 

x-y-5<¢0 


Which of the domains are convex polyhedrons? 
10. Suppose a convex polyhedral domain M is defined in the plane by 


axt+hy+a 20 
ax+hyt+oard0 


Prove that if M contains a ray with the direction vector p = (X, Y), then 


aX+hY20 
mX+hY20 


Formulate a similar problem for a three-dimensional space. 
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